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On the Distribution of Numbers 


By R. W. HAMMING 
(Manuscript received March 17, 1970) 


This paper examines the distribution of the mantissas of floating point 
numbers and shows how the arithmetic operations of a computer transform 
various distributions toward the limiting distribution 


1 
= —— <z< 
r(x) ae Q/o = 4 = 1) 
(where b ts the base of the number system). The paper also gives a number 
of applications to hardware, software, and general computing which show 
that this distribution is not merely an amusing curiosity. A brief exami- 
nation of the distribution of exponents is included. 


I. INTRODUCTION 


The main purpose of this paper is to examine, from the computing 
machine’s point of view, the well-known (to comparatively few people) 
unequal distribution of the ‘‘mantissas” of “naturally occurring’ sets 
of numbers. The observed probability density distributions are often 
close to the reciprocal density distribution 


=a G/bstsn, a 


1609 
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where b is the number base (usually 2, 8, 10, or 16). The corresponding 
cumulative probability distribution is 





€ t dz 
R(t) = [ @ x= I. inh 
_mt+tlnb 
i In b (2) 


where, of course, 
Rid/b) =0 and R(1) = 1. 


From the cumulative distribution, it follows that the probability of 
observing the leading digit N of a number that is drawn at random 
from r(t) is 


R(N + 1) — RIN) = MN (3) 


and this is usually what is measured in experiments. 

A typical experiment is that of tabulating the number of physical 
constants in a table having a given leading digit (see Table I and Ref. 1, 
p. 7). The result looks reasonable. Many other examples of observing 
the reciprocal distribution have been reported. For references see 
Refs. 2 and 3. 

The reciprocal distribution has been explained in many ways. One 
popular but not immediately obvious explanation for the distribution of 
physical constants is as follows. Consider the distribution of the leading 


TasBLE [—Tue DISTRIBUTION OF THE LEADING 
Diets oF 50 PuysicaLt CoNSTANTS 


Number of cases | Expected number 


Leading digit NV observed eq. (8) Difference 
1 16 15 1 
2 11 9 2 
3 2 6 —4 
4 5 5 0 
5) 6 4 2 
6 4 3 1 
7 2 3 —1 
8 1 3 —2 
9 3 2 1 


or 
i=) 
on 
oO 
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digits of the set of all the physical constants that might occur. If the 
units of measurement were to be changed then the corresponding leading 
digit of any particular physical constant would probably change, but 
it is difficult to believe that the distribution itself would change sig- 
nificantly. To believe so seems to indicate a belief that either the present 
units of measurement or else the new set have some intimate connection 
with the real world. An alternative, and more elegant, explanation is 
given by Roger Pinkham in his classic paper (Ref. 2). The explanation 
given in the present paper is based on how the computer transforms 
distributions during arithmetic operations. In particular the paper shows 
how, from any reasonable distributions, repeated multiplications and/or 
divisions rapidly move the distributions toward the reciprocal dis- 
tribution. The effect for addition and subtraction is somewhat different. 
The paper also shows the persistance of the reciprocal distribution 
once it is attained. 

Since floating point numbers are the basis of most of numerical 
analysis one may well ask why this obvious and experimentally well- 
verified distribution is so often ignored. Is it because it appears to 
contradict the usually accepted model of the number system in which 
numbers correspond to points on a homogeneous straight line? Not only 
are the floating point numbers not uniformly spaced in a computer 
(the difference between the two largest possible numbers is very large, 
while the distance between the two smallest positive number is very 
small, and zero is relatively isolated), but the reciprocal distribution 
shows that even in intervals in which the numbers are equally spaced 
they are not equally likely to occur. 

Thus in analogy with non-Euclidean geometry this paper proposes 
an alternative to the conventional identification of numbers with points 
on a homogeneous straight line. Instead of adopting a measure for sets 
that is invariant under translation 


z’=z+hk, 
we often prefer a measure that is invariant under scaling, namely 
2 = ke (k # 0). 


The reciprocal distribution is of practical as well as theoretical interest 
as we shall show in Section VII. In view of these examples, it is hoped 
that by adopting the machine’s point of view with respect to how 
numbers are transformed by arithmetical operations, the computer 
scientists will become more aware of the importance of this distribution 
in many situations including numerical analysis. 
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Il. THE MODEL 


The floating point numbers in a computing machine form a discrete, 
finite set. As is true in so many applications of mathematics to practical 
problems, we shall approximate a discrete distribution by a continuous 
one of sufficient smoothness. Anyone familiar with the upper and lower 
Riemann Integral sums can appreciate the degree of approximation 
being made (provided common sense is used in choosing the values of 
the curve between the given points). In the limit of the Riemann sum 
all the | Az; | become less than any given e > 0; we of course need to 
stop at the granularity of the number system used, typically 10~° or 
smaller. 

In principle, it is possible to carry this error estimate throughout all 
the subsequent steps of the mathematics to see how much the mathe- 
matics errs from reality; but it is customary to recognize that a little 
intuition will suffice to convince the user that the error will be much less 
than the accuracy of the experiments that the theory is designed to 
account for. Thus we have no need to get excited about such things as 
the Banach measure of a set (Ref..4); we do not intend in this paper to 
let the mathematics obscure what is going on. The fact that computers 
are finite and operate at a finite speed for a finite length of time spares us 
from taking seriously all the confusions that can arise in mathematics 
when dealing with the infinite. 


III. THE BASIC FORMULAS 


In this section we derive the basic formulas which describe how dis- 
tributions are combined and transformed by the four arithmetic opera- 
tions of a computer. Let f(x) be the density distribution of the factor 2, 
g(y) be the density distribution of the factor y, and h(z) be the density 
distribution of the result z of the arithmetic operation. Further, let 
F(x), Gy), and H(z) be the corresponding cumulative distributions. 

For both multiplication and division, the mantissas are directly 
combined and the exponents do not enter into the formation of the 
distribution of the result of the operation. Thus, it is sufficient in these 
cases to consider the distributions for (1/b S$ z, y S 1). 

For multiplication, an examination of Fig. 1 shows that when the 
product falls in the shaded regions then the mantissa of the product is 
in the interval (1/0, z). Thus the cumulative distribution H(z) is given by 


ne=[ [sew aver + ff teoe ay ax 


+f [soe ay ax 
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= [| 1@lGE/bx) — 6/2) + GC) — GE/oa)] ae 


+f s@)1G@/2) — G(1/b2)] dx. 


Differentiating with respect to z to get the density distribution we have 


h@) = {@[GU/b) — GA/b) + GA) — GU /bz) — GA) + G/be)] 


+ [* fedot/oey(1/ex) ae + | fe)gle/2)(1/2) ax 


= ; [. fe g(z/bx) dx + i fo g(z/x) dx. (4) 


Similarly for division. The shaded region of Fig. 2 shows where the 
quotient x/y is less than z; thus the cumulative distribution for the 
quotient is 


no =[ [teow aya + f° [teat ay ax 
+ ff teat ay ae 
= | $@1G@ - G/) + GA) - Ge/) aw 


+ [iG — Ga/t)] de. 





Fig. 1—The cumulative probability distribution for the product z2 = sy. 
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y Y= y=x 








Fig. 2—The cumulative probability distribution for the quotient z = 2/y. 


Again differentiating with respect to z to get the density distribution 
we have 


We) = {@IG@) — GUL/b) + GA) — GA) — Ge) + GC/2)) 
+ | seot—ocelor—a/eylae+ |" He\.-ole/ta(—e/t ae 


= 4] sfedota/ de + As f° afladate/ba) ae. () 


For both addition and subtraction the difference in the exponents of 
the two numbers x and y is used to shift one mantissa with respect to 
the other before they are combined. For addition, we may suppose that 
one of the numbers, say 2, lies in the range 2/2 < x S z. The other 
term, y, therefore lies in the range 2/2 = y = z-b“, where k is the 
number of digits in the mantissa and we set b” = e. Thus the density 
distribution of the sum is 


ne) = [fee — 2) az. (6) 


For subtraction we suppose, without loss of generality, that z = y > 0, 
and 


z2=2-y 
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with 2 S x S z/e. Then the density distribution is given by 


hi) = ia x)g@ + x) dz. (7) 


We have now derived the basic relations for the density distributions 
that arise from combining two numbers from arbitrary distributions 
according to the four arithmetic operations of a computer. 


IV. THE PERSISTENCE OF THE RECIPROCAL DISTRIBUTION 


In this section, we first show for both multiplication and division 
that if one of the factors x or y comes from the reciprocal distribution, 
and regardless of the distribution of the other factor, then h(z) is the 
reciprocal distribution. In particular, if a number is chosen from the 
reciprocal distribution, then its reciprocal is also from the reciprocal 
distribution. For addition and subtraction we show somewhat less. 

For the product set 
(8) 


1 
gly) = wae 


in equation (4). We get for any distribution f(x) 
ios 1 f° f@),_ or _ 4 *f@)_ #4 


bJiyy 2 ‘zInb z zinb 


aL [teat [te ae| =. (9) 


Obviously since z = zy, the same applies if we assume that f(x) is the 
reciprocal distribution. 
For the quotient, again assume Sai (8) and mae it in equation (5). 


5 [. tH) Fin b np T 5 af ae) 


sino [10d [te a} =; _ 5 ao 


In the special case of f(x) being the ‘“‘spike distribution” with all of its 
probability at x = 1 we see that the reciprocal of a variable having the 
reciprocal distribution has the reciprocal distribution. The case of x 
having the reciprocal distribution and producing the reciprocal distri- 
bution, regardless of the distribution of the denominator, is covered by 
the product form, or can be worked out directly if desired. 

Thus, if in a long sequence of multiplications and divisions at least one 








l 


h(z) 
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factor has the reciprocal distribution, then regardless of how the distri- 
butions of the other factors are chosen the result is still the reciprocal 
distribution; the reciprocal distribution persists under multiplication 
and division and cannot be broken by any choices for the other factors. 

For addition let x come from the reciprocal distribution for some range 
with normalization factor N, , and y also come from a reciprocal dis- 
tribution with its corresponding range and normalization factor N» . 
Then writing « = 6“ 

z(1—e) 
ny [OM Be ay 


/2 Mr si 03 


2(1—e) 
= N,N; apes ; | ax 


2/2 2 wv 2-2 


= MMs, (—2 
@ 2 2 


N; 
=%: (11) 





z(1—e) 





2/2 


where N; is some constant. 
Similarly for subtraction (different N;) 


z/e ‘ 
ps fs 


: xet+en2 


Z i 1. . 
= NN, [ zle e+e os 


_ NN, | ie 
8 a | ilae eae 


N; 
= (12) 


dx 





z 


It should be noted, however, that in the last two cases the assumption 
of the reciprocal distribution for such great ranges is suspicious to say 
the least, since we know from experience that all exponents are not 
equally likely. That the reciprocal distribution over a large range implies 
the equally likely distribution of the relevant exponents can be seen by 
examining the base 16 number system in exponents, but where the 
mantissas are in binary. Thus the mantissas can have one of the forms: 


O.lxzrz:-- 
0.01zz--- 
0.001z:-: 
0.0001- -- 
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If we assume 
pt) = (as SB SN) 
x In 16 RO fae es a8 


what are the probabilities of each of the four forms? For the first one 





f ye Se ea 
,enl6” 4in2*” ie 


—_ 1 


4° 


Similarly, each of the others is +. This result is quite different from that 
of the flat distribution (see Table IT). 


V. THE APPROACH TO THE RECIPROCAL DISTRIBUTION 


Having shown that once it arises the reciprocal distribution persists 
for multiplication and division, we need to show how it can arise. For 
this we need a measure of how far a distribution h(z) is from the recip- 
rocal distribution 7(z). It is obvious that 


| i ee ae G (13) 


for any h(z) and this does not provide a useful measure of distance. We 
shall define the distance of h(z) from the reciprocal distribution r(z) by 


h(z) — r@) 
r(2) 
which measures the maximum of the difference relative to the reciprocal 


distribution (it is natural to use the relative error when dealing with 
floating point numbers). 


max = D{h()} = D{fh}, (14) 


1/bszs1 








TABLE [I—PROBABILITY OF OBSERVING MaAntissas With LEADING 
ZEROS IN BASE 16 NUMBERS WHEN WRITTEN IN BASE 2 


Probabilities 
Binary —— 
Form Range Exponent Flat Reciprocal 
0.0001.... |1/16 Sz 51/8 —3 1/15 1/4 
0.001x.... 1/8 S$2s 1/4 —2 2/15 1/4 
0.01xx.... 1/4S2<1/2 —1 4/15 1/4 
O.1xxx.... 1/2s2s1 0 8/15 1/4 
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We showed in equation (9) that for a product, 


r¢) = ; [. 1 1/bx) dx + [ LO 5 e/a) dz. 
Subtracting this from equation (4) and dividing by r(z) we have 
h@) —r@) _1 f° {@) | weft = relb2)) a 


r@) bli @ (2) 
 4(a) [ ge/2) — r@/x) 
+f 2 [se r(@) rel) | a, 
But 


bar(z) = ear = r(z/bz) 


ar) = an = r@/z), 


and we have 


h(z) —r@ _ f° g(z/bx) — r(z/bz) 
ou Ke r(@/be) Je 


7 [ f(a) | aeled el) | ap, (5) 


Since f(z) = 0 for (1/b S$ x S 1), 





MOK rE) = f° jee) Dio} ae + [ f@) Dio} de 
< D{g} 
for all z. From this it follows that 
Dth} S D{g} (16) 


regardless of the choice of f(z). 

We note that the equality would hold if f(x) were a single spike at 

= 1, say, but that in view of equation (13), we generally expect a 
great deal of cancellation in the square brackets of equation (15) as it 
is integrated over the range. 

It is easy to examine the rapidity of the approach in the case of all the 
factors coming from the flat distribution 


be la ey Be, 
PY) 1 —1/b b—1 
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Equation (14) gives for two factors 


we = (525) ae +(4) 


= Gop fin b — @ = D Ing. 


In the base b = 10, this is 
h(z) = #{In 10 — 9 In 2}, (17) 


which (for the proper range) is given by Ref. 5 (p. 37). The distance of 
the flat distribution is 
max |220m10_ | m1 _ 1 1 556. 


1/10s281 9 9 
while the distance of equation (17) is equal to 0.3454--- . See Table III 


for further results. 
Similarly for division using equations (10) and (5), we have 


Obs Cee ie xj(@)| 1 = nel| ae 


r(z) r(z) 
toe i g(x/bz) — r(x/bz) 
+i | a «| r@) jew 





But 
2 7 = r(2/2) 


oT — r(2/b2), 


<D atl fle) dx + i. f(2) ax} 


TasB.LeE IJI—Tue DISTANCE oF A CONTINUED PRODUCT AS A 
FUNCTION OF THE NUMBER OF FAcTORS SELECTED 
FROM A FLAT DISTRIBUTION 


and we have 


h@) — r@) 
r(2) 








Number of Factors Distance 
1 1.558 
2 0.3454 
3 0.0980 
4 0.0289 
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or 
D{h} S$ D{g}. 


In the ease of flat distributions 


1 1 
which for the base 10 is (see Ref. 5, p. 37) 


othe d 


and has a distance of 0.4071 --- 

For addition we select g(y) as a reciprocal distribution (with suitable 
normalization factor NV), subtract the corresponding equations and 
divide by r(z) to get 


he) — r(@) _ “The _™M_N, |& 
r(z) =| f(a) Ne x2— 2x) re) 


2(1—e) Ee = 2 N. r(x) 
- et HAS Sle 


But by the mean value theorem for integrals 


N, 
nora [OORT 9 ay 
r(@) r(0) 2 «BH TG) 


where 2/2 < 6 S 2(1 — e). The integral has been shown in equation (11) 
to be exactly 1. Hence 


D{h(z)} = Dif}. 


A similar derivation works for subtraction. 

In view of the dubious assumption of having the reciprocal distribu- 
tion over a very large range we need to examine more carefully the 
behavior of the mantissas of sums of numbers selected from some 
distribution. Let us imagine a Monte Carlo experiment. We select 
numbers from the range (0 < a S x S b) having the probability density 
distribution p(x) with mean yw and variance o*. Divide the range into 
n equal intervals 


(a,a+h), (a+h,a+ 2h), ---, [a+ (n— Ih, d, 
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where h = (6b — a)/n. By counting how many numbers ous in n each, 
interval we get estimates of p(z). 
Let us add 2” numbers of this set of numbers. The range for the sum is 


(2”a, 26), 


the mean p, = 2” and o? = 2*c”. But the central limit theorem says 
that the distribution of the sum approaches a normal distribution about 
the mean with half width o, . Suppose, for convenience, that u fell in 
the middle of an interval. Then as m increases and we count the number 
of cases of mantissas in each interval (note that the m in the term 2” 
appears in the exponent only) we will find more and more of them will 
fall in the interval containing » (which has the same mantissa as yu); 
the distribution approaches a spike! This does not contradict the central 
limit theorem; it merely says that if» # 0 (u = Ois the exceptional case), 
the distribution contracts as seen from the point of view of floating 
point numbers. In loose words, standing at the origin and viewing the 
rapidly receding mean yp, , the width of the distribution o, seems to get 
narrower as compared to the sum—the sum recedes as 2”, the half 
width changes as 2”””. 


VI. RANGE OF EXPONENTS 


It is now clear that in order to examine carefully the effect of addition 
(and subtraction) on the reciprocal distribution, it is necessary to know 
the distribution of the exponents of the numbers to be combined. 
Unfortunately at this time about the only model we have is as follows. 
Assume a distribution of exponents. Under multiplication and division 
the exponents are added and subtracted (with, due to carries an extra 1 
occasionally added, or subtracted) and by the central limit theorem 
we can expect: (z) that the distribution of the exponents will approach 
a normal distribution (assuming that overflow and underflow do not 
happen first) and (2) that this distribution will gradually spread out 
proportional to the square root of the number of operations. Thus, it 
appears that in practice the distribution of exponents is probably not 
stationary. Addition tends to eliminate the smaller exponents, while 
subtraction tends to increase them. 

Experience in numerical analysis shows that the range of the output 
numbers is usually much greater than the range of the input numbers, 
enough so to make one suspect that the variance increases as indicated 
in the above model. 

As one thinks carefully about the matter of addition and subtraction 
it seems reasonable to believe that they will not greatly perturb the 
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reciprocal distribution; and the experimental data from ‘naturally 
occurring numbers”, which must have included some additions and 
subtractions, seem to bear out this belief. 

The feeling that under repeated additions and subtractions the central 
limit theorem applies to numbers (which is true), and therefore con- 
tradicts the reciprocal distribution of the mantissas, is typical of the 
“fixed point arithmetic” viewpoint of numbers—we are representing the 
sums and differences as floating point numbers, and it is the distribution 
of these mantissas and their possible approach to the reciprocal distri- 
bution that is of relevance here. 


VII. APPLICATIONS OF THE RECIPROCAL DISTRIBUTION 


Besides accounting for the experimentally found distributions, the 
reciprocal distribution is relevant to many optimization situations. 

As a first example,’ consider the problem of placing the decimal 
(binary) point in the number representation system in order to minimize 
the number of normalization shifts after the computation of a product. 
(It was probably the minimization of normalizing shifts that caused 
IBM to adopt the base 16 in the system 360). If the point is placed 
before the first digit, then products of the form 


O.2r%x... 
O.xvxn... 
0.0xz... 


will require a shift to normalize the result; while if it is placed after the 
first digit, then products like 


LV Veoe 
UH Vows 


will require a shift. Clearly these two cases have complementary prob- 
abilities. For the reciprocal distributions the probability p of 


zy = 1/b 
is 


1/bz 
Tihs satya ® 
1 
se ( 1/bz — In 1/0) ax = [ 1 (-#2) ae 
1 


1/d In? b b x /b In” b x 


pet Bs} 
~ In? b 2 


I 
ve 2 
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But for a flat distribution, 

a) lol WES Ga) Ga) 
(; een | 1/b J1/b dy i> b= 1 1/b bx b ay 
—b Va ( 1) 

(44) tL] in - a 


_bInb—(b—1)) 
. (6-17 


For 6 = 2 this is 


p 


I 





p= 2iIn2-—-12 0.88. 


As a second application, consider the estimation of the effect of the 
representation error of numbers in base 2 and base 16. In Ref. 7 
McKeeman reports that the maximum relative representation error 
(MRRE) and the average relative representation error (ARRE) are 
as shown in Table IV, where the average is over the reciprocal distri- 
bution. 

A third example is the application to roundoff propagation. If x, has 
an error €, and 22 has error e, , then in the product 


w+ a 
Le + € 
LiLo + Leo + Loe, + eres 


it is the leading digits that control the estimate of the propagated error. 
For the reciprocal distribution the mean is 








p= f ty _1-—1/b_b-1 
~Jnthnb” Ind bind 
For base 2, this is 

ee ene 

=o 0.72134. 


TasBLtE [V—Maximum RELATIVE REPRESENTATION ERROR AND 
AVERAGE RELATIVE REPRESENTATION ERROR 


MRRE ARRE 
binary 1/2 2-8 0.18 X 2-37 
octal 2-37 0.21 x 10-37 


hexadecimal Q-37 0.17 X 27-37 
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The second moment about the mean is 





, 2/1 Pf @&-»9 -§pit bt 
Ue eho. ae Bel ee la 
which for b = 2 is 
1 (3 1e\:2, 
M. =Ti5 (2 a 5) =~ 0.020674. 


For the flat distribution, ¢ = 0.75 and M, = 0.020833. 

Thus we see that the effect of the reciprocal distribution on the average 
roundoff propagation is surprisingly small. 

Another example in which the reciprocal distribution must be con- 
sidered is that of producing “random” floating point mantissas. To 
generate these mantissas we use the earlier result that a long sequence 
of multiplications of numbers from a flat distribution will approximate 
a reciprocal distribution. Thus random mantissas can be generated by 


Vie 2 Yi Fs (shifted) 


where r, is from the usual (flat) random number generator and “‘shifted’’ 
means after each product the leading zeros are shifted off. How well does 
this work? Experimental verification* is given by 8192 trials. Counting 
the number of mantissas falling in each of N catagories (see Table V). 

The last two columns of Table V give the sign changes observed in 
the difference between the observed and theoretical reciprocal distri- 
bution. The expected number of sign changes might be expected to be 
(N — 1)/2, but since for NV = 2 it is clear that one sign change will 
occur (because the mean of the residuals is zero) we have used N/2 as 
the expected number. The chi-square test shows that the two distribu- 
tions are close; the sign change test shows that the residuals are not 
systematically distributed. From these tests, we see that the generator 
“works.’’ It is interesting to note that the period of this generator may 
well be much longer than that of the underlying flat random number 
generator. 

It is easy to see as a general rule that when we try to optimize a 
library routine for minimum mean running time (as against the 
Chebyshev minimax run time) we need to consider the distribution of 
the input data. Hence floating point numerical routines need to consider 
the reciprocal distribution; the square root, log, exponential, and sine 


* Thanks to Brian Kernighan. 
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TABLE V—DISTRIBUTION OF 8192 RANDoM MANTISSAS 





























Residuals 
Degrees of Sign 
N x? Freedom Changes Expected 
64 61.392 63 30 32 
32 22.804 3l 14 16 
16 11.150 15 8 8 
8 7.724 7 5 4 
4 3.261 3 2 2 
2 1.467 1 1 1 





are all examples. In the case of the exponential and sine, some study of 
the exponents is also necessary. 
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A Mathematical Study of a Model of 


Magnetic Domain Interactions 


By R. L. GRAHAM 
(Manuscript received March 18, 1970) 


In this paper, we initiate a study into the combinatorial aspects of a 
model of the interactions between discrete magnetic domains and their 
potential use in information processing devices. Starting with a simple 
model suggested by W. Shockley, we demonstrate certain (surprising) 
capabilities as well as inherent limitations upon the possible applications 
of the interactions described by this model. It should be noted that this simple 
model does not take into account all of the possible interactions between 
magnetic domains. 


I. INTRODUCTION 


The subject of discrete magnetic domains in certain orthoferrite 
materials has been under active investigation during the past several 
years, both from a theoretical physical viewpoint as well as that of the 
device-oriented physicist (for example, see Refs. 1-6). Considerable 
progress has resulted from these efforts, although needless to say, the 
end is certainly not in sight. Particular attention has been directed 
toward the problem of applying this new technology to the very im- 
portant area of information processing devices, an area in which it 
seems to have natural and significant applications.’” It is our intention 
in this paper to examine certain mathematical aspects of these applica- 
tions for a simple model of magnetic domain interactions suggested by 
W. Shockley. 


II. DESCRIPTION OF THE MODEL 


We shall begin by giving a very brief description of the physical 
situation and its translation into the mathematical model under con- 
sideration. The reader whose interests motivate him to seek a more 
technical explanation is referred to Refs. 6 or 8. 

Roughly speaking, thin platelets of certain orthoferrite materials 
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possess the property that under suitable (magnetic) conditions, small 
(~ 3 mils) discrete cylindrical magnetic domains, hereafter called 
“‘bubbles’’, may be stably supported. Moreover, these bubbles may be 
manipulated by the application of external magnetic fields as well as 
by their own mutual interaction (which in general causes two bubbles 
to repel one another). In a suitable physical environment, the location 
of a bubble in a piece of orthoferrite can be restricted to a finite set of 
possible positions within the material; these are ordinarily arranged in 
a rectangular array. It is possible to apply a local magnetic field to 
specific locations within the array with the following results: 


(t) If a bubble already occupies the position at which the field was 
applied, then nothing happens. 

(iz) If no bubble occupies the position at which the field was applied 
and no bubble occupies any ‘‘nearby” position as well, then 
(still) nothing happens. 

(ii7) If no bubble occupies the position at which the field was applied 
but at least one bubble occupies some ‘‘nearby’’ position, then 
some bubble at a nearby position will leave its original position 
and now occupy the position selected by the field. 


To eliminate the annoying indeterminancy in item (72) it is possible 
to apply “holding”’ fields to all but one of the ‘‘nearby”’ sites which has 
the effect that only a bubble at the unheld position can move. 

The mathematical model which will correspond to the preceding 
description will be phrased in the terminology of graph theory. The 
discrete positions at which bubbles may lie correspond to the set V of 
vertices of a graph G. Two sites which are “nearby” or ‘“‘adjacent’’ to 
one another (this is assumed to be a symmetric relation) correspond to 
two vertices of G which are joined by an edge of G. Suppose bubbles are 
located at (the sites corresponding to) the subset X of vertices V. We 
define a command to be a directed edge e = (v; , v2) with v, and v2 adjacent 
vertices of G. The command e transforms the locations of the bubbles 
from X to X° where 


x i" —{v,} U fo} if ueX, 4X; 
XxX, otherwise. 


In other words, if there is a bubble at v,; but no bubble at v. and the 


t Of course, ‘‘careless’’ application of a magnetic field to an orthoferrite with 
bubbles can annihilate bubbles, create bubbles, split bubbles in two, deform bubbles 
into strips, and so on; but these pathological (though certainly useful) operations 
will not be considered in our model. 
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command e = (v,, 2) is applied to X, then the bubble at v, is moved 
to v.. Otherwise, the command e has no effect on X. A program is 
defined to be sequence P = (e€,, €2, -+:, €,) of commands e;. In 
general, a program P maps the set 2” of all subsets of V into itself 
by X” = (--+(X%)*%)---)*". It is the purpose of this paper to inves- 
tigate the mathematical properties of these maps. 


III. SOME BASIC PROPERTIES OF PROGRAMS 


We begin by making the assumption that G is the complete graph on 
n vertices, that is, all pairs of vertices of G are joined by an edge.t As 
mentioned in the previous section, a program P is a sequence of directed 
edges (€,,€.,°-°° , é,) and P acts on a subset X of the vertices V of G by 

RPS (C94) 3)" 
where for e = (v, v’), 
ee if veX, vo’ ¢ X; 
XxX, otherwise. 


If X C V then |X| denotes the cardinality of X. We note 

Fact 1: For all X C V, and all programs P, |X”| = |X|. 

This follows immediately from the definition of X”. 
~The first interesting result we state is due to W. Shockley who 
called it the 
Non-decreasing Overlap Theorem: (Shockley) For all X, Y C V and all 
programs P, — 


BS Car me Galea eae ee ae 
Proof: Assume for some P = (e,, --- , é,) and subsets X, Y C V we 


have | X71, Y? | <| XQ Y|.Since X” = (---((X")**)---), there 
must exist a least 7 such that 


[exes ND yrit < ees a) yri | 


where P, denotes the program (e,, --:,¢). Thus, for X = X”, 
Y = Y*! ande = é;41 = (a, b) we have 


[xn P| <|XN |. 





t Nothing essential is lost by this simplifying assumption. The vertices and edges 
of the present model should not be confused with any incidental physical vertices 
or edges in a particular device. An edge of the model may be generated for example 
by transferring bubbles from a storage zone to an interaction zone and then returning 
the resultant to the storage zone. 
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Ife #a,c ~ bthence X 1) Y impliesce X°C) Y*. Ifecitheras XT) Y 
orbe XN Y but not both then b e X° f\ Y°. If bothae XQ) Y and 
be X (1) Y then ae X° (1) Y* and be X° (1) Y*. Hence, in any case 


ese oP anes oe a 


which is a contradiction. This proves the theorem. 

Shockley noted that this result shows that there is no replicating 
program P*, By a replicating program, we mean the following: Starting 
with two fixed sets of vertices V’ and V” with V’ 1.) V"” = @ 
and 1-to-1 map 6: V” — V’, we require that for each X C V, 


KX? QW =XNV and W(X*NV)=XNV'. 


In other words, P* does not disturb X () V’ and in V”, P* creates a 
“copy” of X () V’. 

To show this, suppose there were such a program P*. By choosing 
two subsets X and X’ differing in a single element of V, their images 
X?* and X’?* must differ in two points, namely, one in V’ and the 
corresponding point (under @) in V”’. This, however, contradicts the 
non-decreasing overlap (NDO) theorem and therefore P* cannot exist. 

Another consequence of the NDO theorem is the nonexistence of a 
program P* which performs binary addition in the following way. 

Suppose V’ denotes a set of m = 1 pairs of vertices of G, V’”’ denotes 
another set of m pairs of vertices disjoint from V’, and V’” denotes a 
set of m + 1 pairs of vertices, disjoint from V’ and V’”’. We can imagine 
these sets arranged as shown in Fig. 1. 

We can represent an integer 17,0 S$ M < 2”, in the m pairs of V’ 
by letting the jth pair of V’ denote the jth binary digit in the binary 
expansion of M. This can be done, for example, by assuming that 


Oo 


for each pair on either Up e X, U, ¢ X, which will correspond to 
1 


a0, or Uy ¢ X, U, ¢ X, which will correspond to a 1. Thus, for m = 5 
the configuration {V, , U. , Us , Vs , Vs} (Fig. 2) would denote the 
integer 10011,2) = 19. : 

The addition program P* would operate by starting with V’” in some 
fixed configuration (for example, all zeros) and with arbitrary integers 
M’', M” loaded into V’, V’”’, respectively, to form the initial state X; 
after applying P* to X we should get the sum M’ + M” in V’”. 

The reason that P* cannot exist as described is precisely that the 
NDO theorem would be violated. For consider the two additions: 
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< 


. e B 5. : 


Fig. 1—Symbolic arrangement of vertex locations for addition. 


0+ (2”—1) = 2"—1and1-+ (2”— 1) = 2”. The initial configurations 
differ in only two positions. The final configurations differ in at least 


(ae eas —- 

m -+ 1 however, since 2” — 1 = 11 --> lw and 2” = 100 --: Og) . 
Thus, by the NDO theorem we get a contradiction and our assertion 
is proved. 

We give another example of a program which does not exist. If 

= (a, b) is a command and a, be X then X° = X. In the case that 
a and 6 are both in X, we say that there is interference as e acts on X. 
(We can think of the bubble at 6 as interfering with the attempted 


movement of the bubble at a to vertex b.) Similarly, if P = (e:,°-- , én) 
we say that there is interference as P acts on X if for some 2 there is 
interference as e; acts on X"'***’ **| We note 
Fact 2: If P acts on X with no interference then 
xX? = lJ fz}*. 


zex 


Proof: It is sufficient to establish this for the case P = e = (a, D). 
In this case 


e U4 Us 
O| V3 V4 Vs 


Fig. 2—A typical configuration representing an integer. 
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fx}? = ‘s rae 


x, otherwise. 
Thus 


U (s}" = t LVI, eee 


X, otherwise. 


But by the hypothesis of no interference, we cannot have both a and 
be X. Thus 


Rete i {a} U {db}, if aeX _ U ta}? 
X, otherwise 


and the fact is established. 
Fact8: For X = {a, b,c, z}, there does not exist a program P such that 


{a, b}” = {e, 2}, 
{b,c}” = {a, 2}, 
{c, a}? = {b, 2}. 


Proof: Suppose such a P exists. If P acts on these sets with no 
interference then we would have by Fact 2, 


{c,z} = {{a}”, (b}"}, 
{a,z} = {{d}”, {e}"}, 
{b, 2} = {{c}", {a}*}, 
which is impossible since the union of the left-hand sides of the equations 
cannot equal the union of the right-hand sides. Thus, if P = (e,, ++: , én) 
we may assume that there is a least7, 1 S 7 S n, with P;_, = (€,,°--, 
€;-1) such that e; acts on at least one of the sets {a, b}”**, {b, c}7*7, 
{c, a}”*-* with interference. To be specific, assume that it is the set 
fa, b}?'-*, that is, e; = ({a}"**, {b}°**) (the other two cases are 
similar). By Fact 2 we have 
{a, BJP!" = {fa}Pis, (B}}, 
{b, cf!" = {{b}?*, fe}P*}, 
fe, a} = {fe} * fay}. 
Therefore 


fd, e}P* = {{b}Pi, fof Pirs}t = {{b}P 8, fe} P*7} 
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and 
fea) ea Aye ea iielP en ay: 
Hence, 
{a,z} = {b,c}” = {c, a}” = {b,2} 


which is a contradiction. This proves the Fact 3. 

Note that the nonexistence of the program of Fact 3 does not follow 
directly from Fact 1 or the NDO theorem. A similar argument can be 
given to show that for X = (a, b, c, d, A, B, C, D, z) there is no program 
P such that 


{a,c}" = {A, 2}, 
{a, d}" = {B, 2}, 
{b, c}” = {Cz}, 
{b, dj? = {D, 2}. 


IV. THE 2-VALUED BOOLEAN FUNCTIONS 


Our attention will now be focussed on the positive aspects of the 
model. In particular we shall be concerned with the problem of repre- 
senting the Boolean functions of m variables with appropriate programs. 
The way in which a function is to be represented is as follows. Suppose 
m = 2 and consider the function f: {0,1} & {0,1} — {0, 1} by 


r y f(z, y) 
0 0 0 
0 1 1 
1 0 1 
1 1 1 


If the values 1 and 0 are interpreted as ‘‘true’’ and ‘false’, respectively, 
then f is just the truth function of the familiar operation of alternation. 
V will be the set of six vertices (% , 21, Yo » ¥1 » fo, f1) which we indicate 
in Fig. 3. It is not difficult to show that no generality is lost by assuming 
there are no additional vertices. In fact, by using the pair of positions 
Zo , X, In which to observe the result of the program, instead of providing 
the separate positions fo , fi , it is true that if a Boolean function of 
m = 2 variables can be represented by a program in this general way, 
then it can be represented using just 2m vertices, The program P(f) 
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Fig. 3—Symbolic arrangement of vertex locations for computing Boolean functions 
of two variables. 


which represents f is required to have the property that 


foe {2 , Yoh”, fi ¢ {0 , Yo} ”, 
foe {to , yi}, haltech 
foe tis dol fd {ars yo}?™, 
foetocs iy feteyir 


The correspondence between the indices of the vertices of V and the 
values of the variables of f is immediate. In terms of bubbles, one may 
think of the configurations shown in Fig. 4 as representing a 0 and 1 
respectively (compare Fig. 2); P(f) is required to map each of the four 
possible initial states of the 2z,-pair and y,-pair into the correct value 
in the f;-pair. 

It is not difficult in this case to find an appropriate P(f), for example, 
we can take 


P(f) = (20 , Yo) (o » fo) (tr Wi) (Yr » fr) 


This is easily checked, as shown in Table I. We can write the preceding 
result in the shorthand form 


i P(f) 
(0, 0, 0, 1) (o ’ Yo) (Xo ’ fo) (x1 ’ yy ’ fi): 


Note that if f is defined by f(z, y) = 1 — f(z, y), that is, f is the com- 
plement of f, then we can take 


P(f) = P(f)(®o , t1)(%o , Yr) (To » Yo) (fa » Lo) (fo » f1) (Xo 5 fo) 


a wee 
0 j 


Fig. 4—Configurations which represent 0 and 1, 
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TABLE I—Cumu.LatIve Errecr or P(f) 


(x, y) (xo, Yo) (0, fo) (#1, Y1) (m1, fr) f(z, y) 
(0, 0) <> {20, yo} (xo, Yot {fay ua} {fo, Yo} {fo, yop 0 
(0,1) © {2o, yi} Yr, Yo} Yr) Yo ve yo} fry} © 1 
(1, 0) <> {21, yo} Lo, Yo Xi, Yo Yi; Yoh {fi, yop 1 
(1,1) {x1 y1} iti, yi} {try yi} {Xa Ya} 2X1, fi o 1 


as a program which represents f (we leave this to the reader to verify). 
Table II, together with this remark about f, show that all of the 
16 possible 2-valued Boolean functions of two variables can be repre- 
sented by programs. 

A question which naturally arises at this point is whether all Boolean 
functions of m variables can be represented by programs in this manner. 
For m = 1, the answer is in the affirmative (the specific programs are 
left to the reader to discover); for m = 2, we have given the required 
16 programs; for m = 3, the answer is in the affirmative but the number 
(2?" = 256) of programs prohibits their listing here; for m = 4, the 
answer is once again in the affirmative but the calculations necessary to 
establish this are much too long to be exhibited (there are, after all, 
2>* = 65536 functions to consider). The cases m = 3 and m = 4 were 
established by J. H. Spencer.® 

One may note that since all Boolean functions of two variables can 
be represented, then in particular the Sheffer stroke function given by 


f(z, y) 


Y 
0 1 
1 
0 
1 


Kee OO!]S 


0 
0 
0 


TasBLE JJ—ProGRAMS FOR BOOLEAN FUNCTIONS OF 2 VARIABLES 


i P(f) 
(0, 0, 0, 0) (x0, fo) (x1, fo) 
(0, 0, 0, 1) (21, Y1) (x1, fi) (Xo, fo) (Yoy fo) 
(0, 0, 1, 0) 71) Yo) (21, fi) (xo, fo) (Y1, fo) 
(0, 0, 1, 1) V1, Jl XO; fo) 
(0, 1, 0, 0) (Xo, y1) (Xo, f1) (a1, fo) (Yo, fo) 
(0, 1, 0, 1) (Yo; fo) ( 
(0, 1, 1, 0) (x0, Yo) (Loy fo) (1, Yr) (Yr, Yo) (1, fo) (Ys, 1) 
(0, 1, 1, 1) (xo, Yo) (to, fo) (a1, fr) (yr, fr) 
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can also be represented. It is well known that any Boolean function 
of m variables can be generated by expressions containing just the 
variables and the stroke function.’® Hence, one is tempted to conclude 
that any Boolean function is representable by a program. The flaw in 
this line of reasoning is that in order to express a particular Boolean 
function in terms of the stroke function, many occurrences of the stroke 
function and the variables are usually required. This in turn requires 
many ‘‘copies” of the variables to be available to the program in order 
to represent f. But we initially have only one pair of positions which 
indicates the value of any particular variable and by the NDO theorem 
we have seen that there cannot exist a “replication” program which 
would form extra copies of the values of the variables. Hence, within 
this model, we cannot use this technique to generate all the Boolean 
functions. It is certainly true however that if the model were extended 
to include bubble interactions which would allow replication of con- 
figurations (and such are known to exist physically), then all Boolean 
functions of m variables could be represented exactly in the manner 
described. 

These initial results create considerable optimism concerning the 
possibility of representing all the Boolean functions of m variables. 
Such hopes are shattered however by the result (which we later prove) 
that there exists a Boolean function of 11 variables which cannot be repre- 
sented by any program of this type. In fact, even though the fraction of 
the total number of Boolean functions of 11 variables which can be 
represented by programs can be shown to be < 10°”, the author is 
currently unable to exhibit any specific function which cannot be 
represented. Clearly, our understanding of this is less than complete. 
It is not unreasonable to hope that the representable functions could 
eventually be effectively characterized. 

We now restrict ourselves (without loss of generality) to representing 
the Boolean functions of m variables in the following way. We shall 
take V = {2 ,2/,%.,25,-+- 2x, , 2} to be a set of 2m vertices which 
we imagine to be arranged in pairs as illustrated in Fig. 5. As before, a 
bubble in the x; (x) location of the pair (x; , x4) will denote that the zth 
variable of the function f has the value 0(1). The way in which a program 
P(f) represents f is as follows. Choose a distinguished vertex a2 V. There 
is an obvious 1-1 correspondence between {0, 1}” and the class C of 
all subsets X C V such that X intersects each {2z,; , v/} in exactly one 
element given by 


a=(a4,°°',4,)o fy;seV iy; = 2; if a; = 0, 
] 


ao Wee eee 


Yi 
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x; 


Za <m 


Fig. 5—Symbolic arrangement of vertex locations for computing Boolean functions 
of m variables. 


Let A; € {0,1}”, 7 = 0, 1, be the set of alla e {0, 1}” such that f(a) = 2, 
and let C, be the corresponding subsets of C. Our object is to find a 
program P(f) which distinguishes between the sets C, and C, . (Note 
that Cy U C, = C.) Specifically we shall say that P(f) represents f if 


ae X?? for all Xe eC,< 
ag xX?” for all XeC,. 
Let C denote the subset of all subsets x C V with | x | = m and 


for x and y distinct elements of V, let C(x) be the set of elements of C 
which contain x with C(y) defined similarly.‘ Consider the effect of 
the command (2, y) on the members of C(x) and C(y). There are four 


cases: 


(2) X e C(x), X e Cy). 
Then X°” = X and X°” 2 C(x), X°™ ¢ Cy). 
(iz) X e C(x), X ¢ Cy). 
Then X°” = X — {x} U {y} and X°™” ¢C(x), X°™ e Cly). 
(vit) X ¢ C(x), X e Cly). 
Then Xx"? 22 X and xy CG), x eC), 
(w) X ¢C(x), X ¢ Cy). 
Then X°” = X and X°” ¢g C(x), X°™ ¢ Cly). 


Hence, after the application of (x, y) to all the sets in C, the new sets 
C’(x), C’(y) (which now consist of all the subsets in C which contain x 
and y respectively) are related to C(a) and C(y) by 


C(x) = C(x) MN CY), 
C'(y) = C(x) U Cty). 


Stated in these terms, the object of the program P(f) is finally to have 
Cc’ (a) (\ 6 = C, after it has been applied to all the sets in C. 

We give an example which illustrates these concepts. Let f be the 
Boolean function of three variables defined by: 


t This approach was first suggested by J. H. Spencer. 
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x y 2 f(a, y, 2) 
0 0 0 0 
0 0 1 1 
0 1 0 0 
0 1 1 0 
L 0 0 0 
1 0 1 1 
1 1 0 1 
1 1 1 1 


V = {x , xf ,% ,23,%3;,73} andwetake a=. 

Co = {far , 22, ta}, {ar , th, ae}, far, 0h, ah}, fal, te, wah}, 

C, = {far , x2, 05}, {xl , ae, oh}, fal, eh, xs}, fal, 2h , ohh}. 
A program P(f) which achieves the separation is 


P(f) = (at , U2)(xt , Us) (2 , #3)(1 , Ts)(@r , 71). 
That is, 
XeCoaa = eeX?™, 


X¢CQoa ug X?™. 


If C(x) denotes the initial subset of C consisting of all the sets in C which 
contain 2 then we may conveniently record the sequential changes 
which occur in each current C(x) in terms of the original C(y)’s as 
the successive commands of P(f) are applied as shown in Table ITI. 
A little computation shows that the final set in the z/-row, the final 
C(x!), when intersected with C gives exactly 


{ {a1 plas Ce} {a4 1 Xa Mets {a4 Gil sass {xi » Xe , X3}} 


which equals Cy as required. 

In general the problem of representing Boolean functions reduces to 
the following problem. We start with the 2m classes C° (y) = C(y) N G, 
y 2 V. We are then allowed to replace two of the classes C(y) and 
C0 (y’) by two (possibly) new classes C (y) 1) C (y’) and C(y) U 
Cy’). We can repeat this operation as many times as desired with 
any pair of classes currently in the list. Our objective is to eventually 
generate a specified subset C* of C. 

We have already mentioned that for m = 1, 2, 3 and 4 it is possible 
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TasiLE JIJ—CumutativeE Errect or P(f) 





(x1', £2) (x1', £3) 
Li: C(x1) C(2x1) C(x. ) 
xy: C(x1') C(a1') n C(x2) Cai!) a Ces) n C(as3) 
za: Ca) Clas’) u Clas) Oley) v Cle) 
zit Ola) (2') 
x3! C(z3) C(23) (cles) nN C(z2)) Uv Clas) 
23: C(z3') C(x3') 

(x2’, £3) . 2X3) 
Ly: C(x1) Oe Cs) a eee) 
wy: Clas!) a Clas) aC (es) oer AC(x2) NC(a3) 
Xo! C(t’) UC(as C(x1') uC (x2) 
xo: C(x! yn (Clay) ACs) u C(23)) Claes!) a (Clay Cap) uC(es)) 
23: Oa ED Ca Ca) C(a1) U C(a2") u (C(ar’) nC (x@2)) UC(e3) 
X3': C(x3') C(x3') 

(x1, x1’) 











wi: C(a1) mn (C(are’) u (C(ay’) a C(z2)) Vv C(as)) a Clair’) mn Cle2) n Ces) 
gi’: (C(t) nm (C(a2’) uv (C(a1') mn C(a2))) ee ae uv (C(a1') n C(t2) a C(as)) 


te (es!) nm (Cle) m Cle) u O(a) 
Ts: C(t1) v Cla’) vu (Ce) A C(e2)) v Clas) 
v3: es 


to generate any subset of C in this manner. We proceed to show that 
for m = 11, there is a subset of C which cannot be generated. We first 
need several preliminary observations. 

To begin with, for a,b e V, let Aand B denote the current sets C“ (a) 
and C*’(b), respectively, after the 7th command of the program P has 
been executed. In other words, at this point in time C“(a) is the class 
of all the original subsets of C’ which now contain a. For example, if 
a = x5 in the preceding example, then after the fifth (and final) com- 
mand of P(f), C® (#4) is C (a4) N (C® (#8) NC (wz) U C® (as). It is 
immediate that if C“” (a) C C™ (b) then the application of the command 
(a, b) as the (¢ + 1)-st command of the program changes nothing. Hence 
we can assume that we only use commands (a, b) for which at the tume of 
their application C“(a) € C(b) E C™ (a) (we say that C“ (a) and 
Cb) are incomparable). 

Initially all the starting classes C(x), x « V, are mutually incom- 
parable. In general suppose we have a family of classes D = {A, ; 
1<ist},A; CC, with exactly r of the ($) pairs of A; being comparable 
and assume A, and A, are incomparable. Consider the family D’ = 
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D — {A,} — {A.} U {41 A Ai} U {A U Az}. We wish to determine 
how many pairs of the classes of D’ are comparable. By definition 
D’ = {A, (1) A.,4A1 U An, Az, Aa, +--+, Ae}. Of course for 7,7 = 3, 
the comparability between the classes A; and A; remains unchanged. 
There are several cases: 


(7) A;2 Ai,A;: D> Az. 

Then A; > A, UAz.,A; DAN) Az. 
(i) A; DA,,A; DP Ao. 

Then A; > A, () Az. 
(it) A; DA,, A; D Ao. 

Then A; D A, ff) Ac. 
(w) A; CA,,A; C Ae. 

Then A; CA, ff) Az, A; C Ai U Ao. 
(vv) A; CA,,A; CG Ad. 

Then A; C A, U Ap. 
(wt) A; GA,,A,; C Ao. 

Then A; C A, U Ap. 


Finally, we have a most important new comparability in D’, namely 
A, f) Ae C Ai U Az. Thus, at least r + 1 pairs of classes of D’ are 
comparable. An immediate consequence of this observation is 

Fact 4: We can assume that no program P(f) consists of more than (°") 
commands. 

Proof: Since after 7 (nontrivial) commands of a program P(f) have 
been applied, we must have (by induction) at least 2 pairs of the classes 
C(x), ee V, being comparable and since there are just 2m classes and 
therefore (°3") pairs of classes, then P(f) must have S (3) commands. 

Theorem. There exists a Boolean function of 11 variables which cannot 
be represented by a program. 

Proof: It is sufficient to show that for m = 11, there is a subset C* 
of C which cannot be generated by starting with the 2m classes C(x), 
x « V, and recursively applying the transformation A, B > A f) B, 
A U B. Consider a typical program P = (€,,@.,°-+ , e€,) and the corre- 
sponding expressions C(t), presented in Table IV. 

In choosing the ith command e; of P there are at most (2) —i+1 
possibilities for e; since after (e,, -- + , €:-1) has been applied, at least 7 —1 
of the pairs C“~" (x), C°~” (y) are comparable and thus neither (z, y) 
nor (y, x) can be the next command e; . Therefore there are at most 


Fi | (2) —i+ 1] = [m(2m — 1)]! 


t=1 


MAGNETIC DOMAIN INTERACTIONS 1641 


TasLe [TV—CuMULATIVE EFrrect or P 








P: ey eo Ci Ct 
zit C(a1) CC® (x1) C') (21) tees OG. aa aie C(x) 

tit CM(xy') CM(x’)  C®@(zx,') eae gn Ge: Ss C(x") 
Zn. Cm) CO (tm) C®) (2m) see CN) ota eee CC) (2m) 
Lm: COtms’) CO (tm) C®) (tm!) eee OO(Gn) cee nee CO! (an!) 


choices for the sequence of e; , since tf < (") = m(2m — 1) by Fact 4. 
Also, for i = 1, each column C(z), x « V, contains at most two new 
classes which did not occur in the preceding column since only two 
classes are changed at each step. Hence there are at most 


[m(2m — 1)]! ee + 2m 


classes which can be generated by these rules where the additional 
term -+2m comes from the 2m initial sets C(x), x e V. On the other 
hand, since C contains 2” sets X C V, then there are 2?” subsets of C 
which we must try to generate. We are doomed to failure however since 


{lm(2m ~1)]! ee ce 2m} 7 22" 30 


as m — ©. We list these expressions for several small values of m in 
Table V. Thus, not only are we guaranteed a single Boolean function 
of 11 variables which cannot be represented by a program, but in fact 
we have at least 10°° of them. It seems quite likely that there exist 
Boolean functions of five variables which cannot be represented. 
However, at present, no specific example of a Boolean function is known 
which cannot be represented by a program. 


TaBLE V—BoUNDS ON THE NUMBER OF BooLEAN FUNCTIONS WHICH 
Can BE GENERATED 


m [m(2m -1(7') +. 2m 22” 
2 4324 16 
3 19615115520006 256 
10 > 10355 <10% 


11 <10453 >10* 
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V. SOME REMARKS 


A number of partial results are known concerning the preceding 
problems which we shall only mention briefly here. 

The generation of Boolean functions as described has the following 
very natural geometrical interpretation. For a fixed integer n, consider 
the set of the 2” vertices of an n dimensional cube C” and let Ai, -++ , Aon 
represent the 2n sets of 2””* vertices which each lie on one (n — 1)- 
dimensional “face’’. In other words, if the vertices of C” are labelled by 
binary n-tuples in the usual way, then each A, corresponds to a set of 
2”-* n-tuples in which some component is constant. As before, we are 
allowed to replace any two sets A and B in the class of 27 sets of A () B 
and A LU B. We can repeat this transformation as often as desired. The 
question is: which subsets X C C” can be generated in this manner. We 
have shown that there exists a set X C C”* which cannot be so generated. 

More generally, suppose we start with a class of n formal sets 
X,,°°:,X, and ask which formal expressions in the X; can be generated 
using the transformation X,Y — X() Y, X U Y iteratively. It can be 
shown”? for example, that all the elementary symmetric functions (using 
() and U in place of the usual - and +) can be generated. Let us call 
a well-formed expression Z in the X;,’s symmetric in X; and X; if the 
substitution X; > X; , X; — X; , leaves H unchanged. Thus we can 
write £ in the form 


B= (XN X;0W) U(X UX) We) Us 


where the W; are well-formed (possibly empty) expressions in the X,’s 
not involving X; or X; . We say that we collapse X; and X; in EH if we 
apply the transformation X; (1) X; ~ X; , X; U X; — X; , to form 


i’ = (X; 1 Wi) U (4; N We) UW. 


Certainly, if H can be generated using the transformations X, 
YX) Y,X U YX starting from X,,--- , X,, then there is a sequence 
of collapses starting with / and ending with some single variable X; . 
A basie theorem can be proved which asserts that if it is possible to 
generate J, then no matter how we collapse symmetric variables 
starting with the expression / we must reach some single variable X; . 
In other words in attempting to collapse # to a single variable, we can 
never make a ‘“‘bad’”’ move. Once the structure of the expressions 
which can be generated is sufficiently well understood, perhaps the 
representable subsets of C” can then be determined. 

Another line of research suggested by this bubble model is in the 
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following direction. For binary sequences x and y, define d(x, y), the 
(Hamming) distance between x and y, to be the number of positions in 
which the sequences x and y differ. The fact which prevented the 
existence of a program which could add two integers expressed to the 
base 2 was the fact that there are pairs of additions in which the binary 
expansions of the corresponding summands are close together (in the 
metric d) but whose sums are not close, thus conflicting with the NDO 
theorem. What we would like is a mapping m — 7(m) of integers into 
binary sequences for which we have 


d(r(m), r(n)) + d(r(in’), r(n’)) = d(r(m + n), r(m’ + n’)). 
With only this constraint there are trivial solutions, for example, 


m—Illl1-:--- 1. 
—_e 


m 


With this mapping we are essentially expressing m to the base 1 (well- 
known by many cultures to be inefficient for representing large numbers, 
say, those exceeding 10). Hence, we might require in addition that the 
number of binary sequences of length ¢ which are in the range of the 
mapping + to be at least a‘ for some fixed a > 1. Is it possible to find 
a suitable + for which an addition program 7s possible in this model of 
bubble interactions? 

Finally, we have just considered just one rather simple model in this 
paper. Physically, many other bubble interactions are possible (although 
some presently operate with significantly smaller margins than others) 
and this of course would lead to other models. It would be very interest- 
ing to understand the corresponding questions in some of these other 
models. 
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Dielectric Guide with Curved Axis 
and Truncated Parabolic Index 


By E. A. J. MARCATILI 
(Manuscript received May 5, 1970) 


We find the field configurations and the propagation constants of the 
guided modes in a dielectric waveguide with curved axis and rectangular 
cross-section. Outside the guide, the refractive index 1s uniform. Inside, 
the index profile in the radial direction (intersection of the meridional plane 
and the plane of curvature) follows a parabolic law with the maximum at 
the center of the guide; in the direction perpendicular to the plane of curva- 
ture the index is either uniform or parabolic, again with the maximum at 
the center of the guide. The guide with mixed profiles has been proposed as 
an easy-to-support, low-loss, ribbon-like guide for millimeter and optical 
waves while the oir, with parabolic profile in both directions, is similar to 
the “SELFOC™ or “GRIN” image transmitting guides. 

The axial field components are small compared to the transverse com- 
ponents and consequently the modes are almost of the TEM kind. Within 
the guide the field distribution along a quadratic profile 1s a parabolic 
cylinder function of order close to an integer, and 1s sinusoidal along the 
uniform profile. The field COmIDOnEINS outside of the guide decay almost 
with exponential law. 

Inside the SELFOC-like guide, the field distribution of the funda- 
mental mode is gaussian and except for the attenuation the characteristics 
of the beam are similar to those obtained for a guide in which the parabolic 
index profile 1s not truncated. 

The attenuation constant a of any mode ts very sensitive to the radius of 
curvature R. Doubling R reduces a by several orders of magnitude. 

Fixing R and the difference of refractive index between the center of the 
guide and the edge of zt, the attenuation constant a passes through a mini- 
mum for a guide width measured in the plane of curvature which 1s only a 
few beam-widths. 

Radiation loss for the fundamental gaussian mode ts negligibly small tf 
the distance between the center of the beam and the edge of the guide is two 
or more half beam-widths. 

Guides with rectangular index profile in the plane of curvature have less 
radiation loss than similar guides with truncated parabolic profile. 
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I. INTRODUCTION 


A dielectric guide in which the refractive index decreases with para- 
bolic law away from its axis acts as a lens-like medium.’ The trans- 
mission through it is known even if the axis is not straight’ and if the 
parabolic decrease is different in two orthogonal directions* (astigmatic 
guide). 

Though extremely useful in many respects the parabolic medium is 
not realizable since it has ever-decreasing refractive index away from 
the axis and this in turn produces an untenable physical result. Thus 
though we know that in any realizable dielectric guide with curved 
axis, radiation losses are inevitable,’ the modes in the parabolic medium 
with curved axis can have no radiation loss since the refractive index 
tending towards infinity far away from the axis prevents it. 

A more realistic model is achieved by truncating the parabolic index 
distribution. We begin, in Section II, studying the two dimensional 
guide, Fig. 1a, in which the index profile, Fig. 1b, varies as a truncated 
parabolic function along the x axis and is independent of y while out- 
side of the guide the index is uniform. 

Later, this guide is modified in such a way that along y, the index 
profile is either rectangular, Fig. 2a, or another truncated parabolic 
function, Fig. 2b. 

The first of these guides has the index distribution of the dielectric 
thin-film guide proposed in Ref. 6 as a low-loss, easy-to-support ribbon- 
like guide for millimeter and optical waves. It has also the configuration 
of a possible guide for integrated optics.’ This guide, with curved axis 
has been analyzed in Ref. 8 ignoring radiation due to curvature. In 
Section II, both the phase and attenuation coefficients of the guided 
modes are evaluated and compared to those in a similar guide with 
rectangular index profiles along both x and y. 

The results obtained for the guide with truncated parabolic profiles 
along x and y, Fig. 2b, are applicable, at least in order of magnitude, 
to “SELFOC’” or “GRIN’’”® fibers, and tubular gas lenses’* with 
curved axes. 

Finally conclusions are drawn in Section III, while all the mathe- 
matics are given in the Appendix. 


II. MODES IN THE CURVED GUIDE 


Consider the two-dimensional curved guide in Fig. 1a. The parabolic 
refractive index within the guide is independent of y and equal to 


je nf = a(a a 22)" . (1) 
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Fig. 1—(a) Two-dimensional truncated parabolic guide; (b) Refractive index 
role: (c) Electric field distribution of the fundamental mode. 


where a is the width of the guide, n, the refractive index in the center 
of it and n(1 — A), the refractive index at the edges. Outside the guide, 
the index is 





n, = n(l — A— A,). (2) 
We make the following assumptions: 
A<1l (3) 
A,<l 
and 
mr ae ee ra (4) 
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g. 2—(a) Inhomogeneous dielectric thin film guide; (b) “SELFOC®” or 
“GHin” guides with rectangular cross-section. 


where \ is the free-space wavelength and R the radius of curvature 
of the guide. The physical significance of inequality (8) is that the 
guided modes will have phase velocities quite comparable to that of a 
plane wave in a uniform medium of refractive index n. The inequality 
(4) insures that the amplitude of the field components at the edge of the 
guide are small compared to their maxima within the guide. In other 
words, most of the electromagnetic field is well confined within the 
guide, Fig. 1c, and consequently the loss per wavelength is small com- 
pared to unity. Considering only guided modes with field configura- 
tions Independent of y, we can group them in two families: TE and 
TM. The field components of any mode of the first family are E, , 
H, and H, while those of the second are H, , H, and E, . In each family 
the transverse components are far larger than the axial components 
and consequently both families are essentially of the TEM kind. 
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The transverse components L, , H, , H, and E, of both families 
have the same functional dependence within and without the guide. 
Therefore we will talk from now on of the # field meaning either one of 
those four components. 

Within the guide, and subject to the conditions (8) and (4), the # 
field distribution for the pth mode is essentially 


EH = exp | He, 


in which the first two factors describe the field distribution along 2, 
and the last gives the propagating wave dependence along the curvi- 
linear z axis. Similarly to the field distribution in the lens-like medium 
(a = o), the first factor is a gaussian with its maximum located at a 
distance 


a a 
Ege oto 9 exp like — wf] 6) 


fod 














W W 


2 AO 
Pe  8AR 
from the center of the guide. The normalizing 1/e half-width is 


(6) 





= ar. 
oc Vara a 


The second factor in equation (5) is a Hermite polynomial of order p 
which is also centered at x = —(a/2) + p and the argument is normal- 
ized to w/2. Strictly speaking the expression (5) should have, instead 
of the Hermite polynomial, a Hermite function of order close to p. 
Interested readers can find the details in the Appendix. 

For the fundamental mode p = 0 the Hermite polynomial is unity 
and the transverse field distribution is the well-known gaussian. 

The propagation constant k, = 8 + ita in equation (5) is complex 
and the phase and attenuation constants calculated in equations (36) 
and (87) are 


a 5 inyaig Cn 
cae p.l 3 T T6ak? ~ whn KG + | (8) 
and 
a 2pt+l1 
a , 
_ Lu 604 pa Me (y 220)" F 
ay ae TTY exp { la ae (> + 5)(22) | 


a 
2w 





2 (1 = a} (9) 
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in which 
{ 
| 2\? 1 
Bo = knN1 — (=. (p ae 1) (10) 
= 1(a) 
d= a C@\w/’ (11) 
R= sie (2A)3R, (12) 


and the values of M and K can be found in equations (88) and (89). 
Let us discuss the physical meaning of some of these formulas. 

The phase constant 6 given in equation (8) is the product of the 
phase constant 8,,(10) of the lens-like medium with straight axis (R = 
a = ©), multiplied by a bracket essentially equal to one; the two small 
terms contained therein take into account the curvature of the axis 
and the truncation of the parabolic profile. __ 

More interesting is the attenuation constant (9). The value ~/2A Ra 
which is the normalized attenuation per radian has been plotted in 
Tig. 3 for the fundamental mode p = 0 and A, = 0. The abscissa is 
the square of the guide width a normalized to the beam-width 2w or 
its equivalent (mna/d) V A/2 which is the guide width normalized to 
the free wavelength. The parameter used for the solid curves is the 
normalized radius of curvature ®(12). For a given radius of curvature 
the loss per radian is highly sensitive to the width of the guide and 
passes through a minimum at width 

& = (8) 

2w =\8/* 
For a wide range of values of ®, say 10 to 1000, that minimum loss 
occurs when the guide width is only a few beam-widths. 

The dotted lines are curves of constant d, that is constant ratio. 
2p/a between the beam displacement from the guide axis p and the 
guide half-width a/2. It is easy to understand the downward trend of 
these curves for large abscissas. Consider a guide with fixed geometry 
and decrease the wavelength \ of operation. The beam remains at the 
same distance p from the guide axis but it becomes narrower and conse- 
quently the field at the edge of the guide and the radiation loss de- 
crease. It is surprising that the minimum radiation loss of the solid 
curves occurs when the beam displacement is a small part of the gidue 


width (d of the order of 0.1). 
Why do the solid lines have a minimum? For very narrow guides 
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(a/2w <«< 1), most of the electromagnetic field travels outside of the 
guide and any curvature of the axis introduces substantial radiation 
losses to this loosely guided beam. On the other hand, for very wide 
guides (a/2w > 1), any curvature of the axis displaces the beam close 
to one edge of the guide (d close to unity) and once again substantial 
losses occur. There must be a minimum in between. 

It is interesting to compare the losses in these guides of truncated 
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parabolic index profile with guides of identical width but with rectangu- 
lar index profile of height nA. In Fig. 4, the solid curves are a repetition 
of some of those in Fig. 3, while the dotted ones have been reproduced 
from Ref. 12. The abscissa is again (a/2w)” which is identical to (7/4)a/A 
in which 


io-! 


{072 


1073 


1076 





8 12 i6 
am a_fa\y_ ma fa 
4A \2w]/ ~~ > 2 


Fig. 4—Radiation loss in curved guides with truncated parabolic index profile 





(solid curves) and with rectangular index profile (dashed curves). 
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is a dimension such that for a < A, the guide with rectangular index 
profile supports a single mode and for a > A, the guide is multimode. 

Tor the same radius of curvature, guide width, and same A on axis, 
the guide with truncated parabolic profile has more loss than the guide 
with rectangular profile. The difference is very marked for large abscissas, 
but this result should not be surprising because in the case of curved 
guides with truncated parabolic profile the beam travels close to one 
edge of the guide where there is little difference of refractive index 
between the inside and outside, while in the case of rectangular profile, 
though most of the power travels also close to one edge of the guide the 
full difference of refractive index nA is there to help in the guidance. 

In Fig. 5 we have plotted again the attenuation per radian as a 
function of (a/2w)’, but this time we use as parameter, the value of 





which is the number of beam half-widths between the center of the 
beam and the external edge of the guide. The curves have asymptotes 
(dashed lines) parallel to both coordinates. 

For h = 2, A = 0.01, the attenuation per radian aR turns out to be 
smaller than 0.003, which is very small for most purposes. 

If the truncated parabolic profile is on a pedestal (A, # 0), the 
losses are even smaller than those depicted in Fig. 4. The influence of 
A, in the attenuation constant (9), appears in the bracket of the ex- 
ponent. The other two terms are in general small compared to unity. 
Therefore even a modest value of A, , say A, = A, is enough to reduce 
the losses depicted in Figs. 3 and 5 by several orders of magnitude. 

What happens when p ¥ 0. From equation (9) we find as expected 
that for a given guide the radiation loss increases fast with the order p 
of the mode. The highest order mode that travels only slightly in- 
fluenced by the guide width is characterized by 


2 
2 
a 


2 2 


Les. 1 
w —Zz=h —-§. 





Pmax = 


Naturally pax 1s independent of A, , and when the beam center is close 
to a beam half-width from the edge, p,.. = 0. 

It is shown in the Appendix that if the refractive index profile along y, 
Fig. la, is not uniform but has either rectangular or truncated para- 
bolic shape, Figs. 2a and 2b, the guides have different phase constants 
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Fig. 5—Radiation loss in curved guides with truncated parabolic index profile. 
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than equation (8) but practically the same attenuation constant (9) 
provided that most of the electromagnetic field travels within the 
guide. Therefore everything said about attenuation in this section 
applies to the three guides. 

For the following examples we will only use Figs. 3, 4 and 5 since all 
the important results and formulas are there. 
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2.1 Example A 
For a guide such that 


n = 1.5, 
= 0.01, 
A, = 0, 


a = 0.1 mm, 
X= Ip, 


what is the radius of curvature R for which the loss per radian is of 
the order of 107°? 
We calculate the abscissa and ordinate of Fig. 5 to be 


a \° me 
(2) == gree 


\/2AaR = 1.4-107*. 


and 


The parameter h obtained from Fig. 5 is approximately 2 and we derive 


2 
a 


R TAG ee 





A very small radius indeed. 


2.2 Hxample B 


For integrated optics a guide with truncated-parabolic profile may 
have the following characteristics 


= 1.5, 
= 0.01, 
A, = 0, 
a = 10n, 
A = 0.5n, 
Rk = 0.6 mm. 


What is the loss per radian? 
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Irom Fig. 3 or 4 we get the abscissa and parameter 


7 4rnR 


(2A)? & 60. 


R 


Consequently the loss per radian results 


ak = 0.018. 


If instead of parabolic the index had been rectangular, from Fig. 4 
we deduce that the loss per radian would have been 0.00018, two 
orders of magnitude smaller. 


III. CONCLUSIONS 


For losses small enough, the field configurations and phase constants 
of the modes in dielectric guides, Figs. 2a and 2b, with curved axis 
and parabolic index profile on a pedestal, are quite comparable to 
those in a similar guide in which the parabolic profile is extended to 
infinity. 

The attenuation constant of a mode is very sensitive (exponential 
dependence) to the radius of curvature, size of the pedestal and order 
of the mode. The higher the order of the mode and the smaller the size 
of the pedestal the larger the loss. 

Quantitative results about the attenuation constant for the funda- 
mental gaussian mode in a guide without pedestal are given in Figs. 3, 
4 and 5 and in typical examples at the end at the preceding section. 
We find in these figures the loss per radian af as a function of the guide 
width a, using as parameter the radius of curvature R, or the ratio 
between beam displacement p and guide width or the ratio between 
the beam distance from the edge of the guide, a/2 — p and the beam 
width w. The main conclusions are: 


(4) Doubling R reduces the attenuation constant a several orders 
of magnitude. 

(72) For any R, there is a guide width that minimizes the loss per 
radian. That dimension is only a few beam-widths. 

(i2t) For comparable characteristics, guides with rectangular pro- 
files have lower attenuation than those with truncated-para- 
bolic profile. Therefore if the transmission of images is not 
important, such as in the case of the ribbon-like guide of Ref. 6 
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and guides for integrated optics, rectangular index profiles are 
more attractive than parabolic profiles. 

(iv) The attenuation per 90° bend is smaller than 107° in a guide 
such that the distance between beam center and the external 
edge of the guide is larger than a couple of half beam-widths, 
that is, if 


bole 
| 
> 





Ow > 1- 


APPENDIX 


Modes in Curved Guides 
With Truncated-Parabolic Index Profile 


We start studying the two-dimensional curved guide depicted in 
Fig. la in cylindrical coordinates. Later we will introduce a variation 
of the index profile along y. 

The parabolic refractive index distribution within the guide is 


nN, = nf — a(t +2 ry (13) 


where a is the width of the guide, n the refractive index in the center 
and n(1 — A) the refractive index at the edges. The refractive index 
outside the guide is 

n, = n(l — A— A,). (14) 


Assuming that the electromagnetic field does not vary along y and 
that the only component along that direction is H, , all the field com- 
ponents either inside or outside the guide are™ 








H,=H 
b, = w = 
UY exp [i(6 — w1)] (15) 
t dH 
ae rr 


o 


where w is the angular frequency, ¢€ the refractive index of free space, 
and the indices 7 and o refer to the inside and outside of the guide. 
The resulting wave equation for both media is 


2 2 
uses + oo (xn _ “i = 0 (16) 


dr ‘+r dr : 
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in which k = 27/\ and Xd is the free space wavelength. Within the 
guide n; is given by equation (13) and the wave equation can be re- 
duced to 


@H at 











ge tin +2 — i€ + &) 1H = 0 (17) 
by making the following substitutions 
2(r — Re) 
ow aye ed 
(18) 
f == (1 — d), 
v=k.R, (19) 
_ hint — kes ze d) L 
in which 
laA an 
— j74 _ Lae 21 
. 7 ™M V/8A 21) 
a a _ 2p 
Oe GAR a! (22) 
a = = ea), (23) 
and 
r 
= ——. 24 
nv 8A a 


TFurthermore, equation (17) has been derived making the following 
simplifying assumptions 

at, 

A, «1, (25) 


nN a 
aVA Si aR 





The physical significance of w, d, A and the inequalities are given in 
the text. 
The solution of equation (17) is™* 


H. = Die+&) = exp |-(EL#) neers) 0 
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where D,(é + £&) is the parabolic cylinder function of order y and 
He, (£ + &) is the Hermite function of order 7. Only if a — ©, 7 
becomes an integer, the Hermite function is reduced to a polynomial 
and H; becomes the well-known solution of the parabolic lens-like 
medium extending to infinity.’ 

Outside of the guide, that is for r > R, the refractive index n, is 
uniform, equation (14), and the solution of the wave equation (16) is” 
the Hankel function of order v and argument kn,r. That is 


H, = HS (kn,r). (27) 


To match fields at the boundary r = R, the radial admittance 
H,/E, inside and outside the guide must be identical. With the help 
of equations (15), (26) and (27), we obtain the characteristic equation 


w nD,(&o) = H;” (kn R) 


2 Dik) ~ HO" (nh) -) 





in which the derivatives are taken with respect to the arguments of the 
functions. 

We should have another boundary equation for the other side of the 
guide, r = R — a, but we are interested in guides with radius of curva- 
ture & small enough to push the field away from the center of the 
guide, and consequently the field at the interface r = R — ais negligibly 
small. 

To solve explicitly the boundary or characteristic equation (28) for 
k, , we need asymptotic expansions of the functions involved. From the 
inequalities in equation (25), it can be deduced that 


| |>>1 and | &{ >| |. (29) 


The asymptotic expansion for D,(£) is then”* 


V2 


I'(—7) 


where I'(—7) is the gaussian function of argument (—7). 
The asymptotic expansion for the Hankel function results from 
observing that as a consequence of equation (25) 


kn,k > 1, 
kk > 1, (31) 


2 


D,(&) = & exp (-8 — inn) 5 





prep (@) 0) 


kino 


aco 
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and 
(k? — k’n?)} « 1, 


Therefore we can replace the Hankel function by Watson’s approxi- 
mation.’* This approximation involves Bessel functions of order one- 
third and large arguments. Keeping the first term of their asymptotic 
expansions, the Hankel function results 


Bee) aR(k2 — kn? : kn?) {i exp |, (ke — ent) | 


On er ‘]} 

+ 9 exp | 3k. (k, k No) (32) 
Substituting equations (80) and (32) in equation (28) we obtain a 
simplified version of the characteristic equation 


2 —27-1 A ° 
1+ nee = fo" exp (E+ inn) 


be a ee exp (§@ a + inn) 


1 +7 exp [oe (ke — int) | 


= een ee, (33) 
To solve this equation for k, we rewrite it as 
I(—n) = F(n) (34) 


and notice that F(n) is a large quantity. Therefore the gamma func- 
tion is also large and hence 7 must be near a pole, which makes 7 close 
to an integer p. Then we can replace the gamma function by the first 
term of the Laurent series (—1)’/p !(p — 7), and equation (34) becomes 





(= 1)" 
=p — : 35 
Substituting 7 by the value given in equation (20) we derive the ex- 
plicit value of k, . This propagation constant is complex, k, = 8 + ta, 
and the real and imaginary parts are the phase and attenuation con- 
stants of the pth mode: 


B= Rek, 


(36) 
= bot — Gaon Le +3 +300 ~ 5) + Ra |} 
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[-3 (Gr) 
_OPL 3 Vm / J1+2M — Ml? 


ree dKRV/2A a+ My 87) 
where 
A, 4 73 
ap = ; i a7 @*5)aa | (38) 
(1 — a)’ 
___ exp E ad — | 
K = V2rp! (39) 


[Vad i — ad) 


In equation (37), M affects the value of a mostly via the exponential 
and not via the fraction 


1+ 2M — mM’ 
(1 + My’ 


which for all practical purposes can be replaced by 1. Consequently 
the normalized loss per radian ~/2ARa results 


exp | 8 (44] 


L= V2A Ra = a 


(40) 
Now we turn to guides in which the refractive index is a function 
of y, Figs. 2a and 2b. 
Let us start with the ribbon-like structure of Fig. 2a and assume as 
in Ref. 6 that 


By 2 2: (41) 


Provided that most of the electromagnetic field travels within the 
ribbon, the attenuation per radian is still given by equation (40), but 
the phase constant is a slight modification of equation (36). From Ref. 12 
is deduced 


(1 4 20 — A,)’A, yt or field 
kn | x(q + 1) |? ab polarized along y, 
Pee ere on 
( 14 2 Ay" for field 
rb polarized along z, 


where gq + 1 indicates the number of maxima of electric field within 
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the guide along y and 


A (48) 


ee Se 
: nV 8A, 
Consider another guide, Fig. 2b, with rectangular cross-section and 
truncated parabolic index profile along both the x and y directions 


n= nf a(i a: gt — Ry = a(2)']. (44) 


Provided that most of the electromagnetic field is within the guide 
cross-section, the loss per radian is still given by equation (40), but 
the phase constant becomes* 
Z a)" | 
1 (1 + in 


2 | 


1 
Bs = B- Pees 2 SS ee Se ee 
oh =— 2q+1 3 
in  vea(2y +044) 


A 





(45) 


where gq + 1 is the number of maxima of the electric field along y and 


a ee 
W. = , /BA (46) 
If 


and 


the guide has square cross-section and equations (40) and (45) yield 
a first approximation of the phase and attenuation constants in a 
curved SELFOC? guide. 
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Radiation Losses of the Dominant Mode 
in Round Dielectric Waveguides 


By DIETRICH MARCUSE 


(Manuscript received March 5, 1970) 


The radiation loss theory that has been developed in a series of earlier 
papers is extended to the dominant mode of the round dielectric waveguide. 
The theory 1s applied to the calculation of radiation losses of abrupt steps, 
gradual tapers, and random wall perturbations of the round dielectric 
waveguide. 

The radiation losses caused by an abrupt step, and consequently the losses 
of tapers, are far higher for the dominant mode of the round dielectric wave- 
guide than they are for corresponding steps and tapers of the dielectric slab 
waveguide. However, the losses caused by infinitesimal random wall pertur- 
bations of the round waveguide are nearly equal to the random wall losses pre- 
dicted on the basis of the slab waveguide theory. In fact the losses of the 
dominate mode as well as the circular electric TE, mode of the round rod due 
to random wall perturbations are very nearly the same. 

The theory ts limited to circular symmetric distortions of the round dielec- 
tric rod (diameter changes). The radiation losses caused by steps of the 
round dielectric waveguide that carries the dominate guided mode have been 
verified by experiments at millimeter wave frequencies. 


I. INTRODUCTION 


A series of earlier papers was devoted to radiation losses of TE and 
TM modes in dielectric slab waveguides.’~* The radiation losses were 
assumed to be caused either by random perturbations of the waveguide 
boundary’ or by steps and tapers of the slab waveguide.* Experiments 
to verify the radiation loss theory were conducted with millimeter 
waves in round teflon rods, and the theory was extended to cover this 
case.” 

These earlier publications were limited to the simplified case of 
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electromagnetic fields that are independent of one coordinate. In the 
case of the slab waveguide we assumed 


0 


ay (1) 
while 

0 

a6 0 (2) 


was required of the fields of the round dielectric waveguide. Restrictions 
(1) and (2) made it possible to separate the fields into transverse 
electric (TE) or transverse magnetic (TM) modes. 

The study of the simple slab waveguide yielded much useful informa- 
tion about the general properties of radiation losses and allowed us to 
infer the order of magnitude of the radiation losses caused by random 
wall imperfections. However, the dielectric slab is not a useful practical 
waveguide and can be used only as a simplified model to obtain informa- 
tion about the behavior of more realistic and more complicated struc- 
tures. Limitation (2) for the modes of the realistic and practical round 
dielectric waveguide precludes the application of the theory to the most 
important dominant mode of this structure. 

The present paper is devoted to a study of the radiation losses of the 
dominant mode of the round dielectric waveguide (optical fiber). To 
be able to handle the theory we still impose condition (2) on the deriva- 
tives related to the geometry of the waveguide but not on the field 
distribution. The resulting theory is still very complicated so that we 
must limit ourselves to sketching the theory and stating the final results. 

The radiation losses caused by random imperfections [obeying restric- 
tion (2)] are very nearly identical to the losses of the corresponding slab 
waveguide problem. However, the radiation losses of the dominant 
mode caused by steps and tapers in the waveguide are much higher 
than the corresponding losses of the TE or TM modes in the slab 
waveguide. The radiation losses of the dominant mode due to waveguide 
steps have been found experimentally to be in agreement with the 
theory. 

In order to allow the reader to obtain the information concerning the 
results of the theory unencumbered by complex mathematical formulas 
we start the paper with a discussion of the results. The remainder of 
the paper is devoted to an outline of the theory that was used to obtain 
these results, 
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II. NUMERICAL AND EXPERIMENTAL RESULTS 


2.1 Radiation Losses of Waveguide Steps 


We begin the discussion of the consequences of the radiation loss 
theory of the dominant mode of the round dielectric waveguide by 
considering the radiation losses caused by the abrupt step of the wave- 
guide diameter shown in Fig. 1. As described in Section II, the radiation 
losses caused by an abrupt step can be calculated by two different 
methods. The mode matching technique infers the loss from the trans- 
mission coefficient of the guided mode that continues to travel in the 
waveguide after it has passed the step. The radiation loss method 
accounts for the lost power by directly calculating the amount of power 
radiated into space. Both methods involve approximations so that we 
cannot expect to obtain exactly the same results either way. 

Figure 2 shows the results of both methods of calculation. The radia- 
tion loss caused by a step with a,/a, = 0.5 as a function of ka, (as 
computed by means of the mode matching technique) is shown as the 
dotted line in the figure, while the solid line represents the result of the 
radiation loss method. The curve holds for a dielectric rod with index 
of refraction n = 1.482 (n? = 2.05). This index was chosen since it is 
representative of teflon at a frequency of 55 GHz. The agreement of 
the two methods is remarkably good considering the approximations 
involved in deriving the theoretical expressions. 

Even better agreement is obtained by a similar calculation that applies 
to a dielectric rod with index of refraction n = 1.01 as shown in Fig. 3. 
Both figures are extended over ka, values that correspond to single 
guided mode operation. There are other guided modes possible over 
part of the range of ka, values but these other modes do not couple 
to the dominant mode of the round dielectric rod because of the restric- 
tion on symmetry imposed by equation (2). It is in this sense that the 
operation of the waveguide is single mode. No other guided mode occurs 
under the imposed conditions. The shape of the two curves in Figs. 2 


1 ay 


Fig. 1—Step in the round dielectric waveguide. 
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Fig. 2—Relative radiation loss caused by an abrupt step with a:/a: = 0.5 of the 
waveguide. The two curves labeled dominant mode of the round waveguide were 
obtained by the mode matching technique (dotted line) and by the radiation loss 
technique (solid line). The two curves at the bottom of the figure labeled TE and 
TM modes represent the step losses of the slab waveguide. The radius a, (appearing 
in ka,) belongs to the larger waveguide section. Index of refraction n = 1.482. 


and 3 is very similar. Both curves reach into high loss regions for small 
values of ka, . The curve of Fig. 3 is applicable to a clad optical fiber 
with 1 percent index difference between core and cladding. The curves 
shown on the bottom of Figs. 2 and 38 represent the step losses of TH 
and TM modes of the slab waveguide.* These curves are computed for 
the same index of refraction. The dimension a, (of ka,) is the half width 
of the slab in the case of the slab waveguide. It is striking how much 
lower the radiation losses of the guided modes of the slab waveguide are 
compared to the dominant mode of the round dielectric rod. 

Because of the complexity of the theory and because the step loss 
results are so different for the round rod and the slab waveguide, it 
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appeared desirable to confirm the loss predictions of the theory with an 
experiment. The experiment was conducted with millimeter waves 
(approximately 55 GHz). A round teflon rod of 0.191 cm diameter was 
mounted between two metallic reflectors as shown in Fig. 4. The resulting 
resonant cavity could be excited through small holes in the reflector 
plates that, simultaneously, acted as supports for the teflon rod. Two 
teflon sleeves of 0.216 cm and 0.242 cm outer diameter could be slid over 
the teflon rods to produce a round dielectric waveguide with two steps. 
The losses caused by the steps could be determined from Q measure- 
ments of the cavity with and without the teflon sleeves. The results 
of these loss measurements (applied to one step) are shown as crosses 
in Fig. 5. This figure also shows the theoretical loss predictions of the 
mode matching (dotted line) and the radiation loss approach (solid line) 
of the theory. Note that the parameter value ka, = 1.1 of this figure 
uses the fixed value of the narrower portion of the waveguide as reference. 
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Fig. 3—This curve is similar to Fig. 2 with n = 1.01 and az/a, = 0.5. 
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Fig. 4—Experimental resonant cavity set up to measure radiation losses of wave- 
guide steps. 


The point a./a, = 0.5 of Fig. 5 corresponds to the point ka; = 2.2 of 
Fig. 2. The measurements support the result of the round rod theory. 
The radiation losses of the slab waveguide even for much larger steps 
are still far lower than the measured values of these smaller steps of 
the round rod. 

It is not as easy to confirm the loss predictions of the slab theory 
since a dielectric slab waveguide is somewhat of an idealization. In 
particular it is hard to excite a slab with a mode that has no field varia- 
tion in the y-direction. In order to obtain some approximation to the 
slab waveguide ‘we constructed a dielectric (teflon) ribbon whose 
dimensions on the narrower portion were 0.380 by 0.095 cm and whose 
wider dimensions were 0.380 by 0.190 cm. Note that only the narrow 
side is affected by the step. The losses of this ribbon waveguide with 
a 2:1 step were measured in the same resonant setup and compared to 
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the losses of a smooth ribbon with dimensions 0.880 by 0.095 em. The 
radiation loss of the ribbon guide was AP/P = 0.08 for kd, = 1.1 
(or kd, = 2.2). This radiation loss value is shown as the circle in Fig. 5. 
It is apparent that the loss of the ribbon guide is far smaller than 
the loss of the round waveguide. It is about four times higher than 
the step loss predicted for the slab waveguide. However, we must keep 
in mind that the ribbon is only a poor approximation of the slab wave- 
guide. It is therefore not surprising that its radiation loss cannot be 
predicted by the slab waveguide theory. The slab waveguide apparently 
can tolerate steps in its width exceptionally well. 


2.2 Radiation Loss of Tapers 


The radiation loss theory that is presented in the theoretical part 
can be used to determine the loss of round dielectric waveguides with 


E adhoc asl Me lemectracsn 
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Fig. 5—Comparison of theory and experiment. The crosses are measured step 
losses of the round dielectric waveguide. The circle is the step loss of a ribbon guide. 
The curves represent the results of the mode matching theory (dotted line) and 
the radiation loss theory (solid line). (x = 1.432, kag = 1.1.) Note that the curve 
parameter ka, uses the radius of the smaller waveguide section. 
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arbitrary diameter changes. Since the radiation losses of an abrupt step 
are very high for round dielectric waveguides it is interesting to study 
the radiation losses of gradual tapers.” 

The calculation of the radiation losses of tapers can be simplified by 
observing that the dependence of 8 on the radius of the waveguide is 
nearly linear over a considerable range of values. Figure 6 shows the 
ratio of Bo/k as a function of ka for n = 1.432. It is apparent that a 
straight line approximation is possible in the region 1.2 < ka < 2.5. 

We study the radiation losses of two different tapers. The linear taper 
is the simplest and therefore the most reasonable taper to investigate. 
However, there are reasons to suspect that the linear taper may have 
higher radiation losses than other forms of tapers. It is apparent from 
equation (36) of Section II that the result of the integration (aside from 
the complicated factor I(p, 2) which is difficult to evaluate) depends on 
the product of the derivative of the radius function a(z) with sine and 
cosine functions of the form cos f% [8.(z) — Bldz. (8, is the propagation 
constant of the guided mode; 6 is the z-componcent of the propagation 
constant of the radiation modes.) The oscillatory function has the 
tendency to cancel contributions from those functions that appear 
multiplied with it under the integrand. The more rapidly the cosine 
function oscillates, the more effective will be its canceling influence. 





Fig. 6—Plot of the propagation constant 8 of the dominant mode of the round 
dislocate waveguide. (n = 1.482.) 


LOSSES IN DIELECTRIC WAVEGUIDES 1673 


This consideration shows that we would like to see the values of 
8.(2) — 8 as large as possible. The smallest possible value, and conse- 
quently the most harmful, is the value 6)(z) — & that is assumed at 
the upper end of the integration range in equation (84). However, 
because of the 2 dependence of £8 the values of 6.(z) — k& are smaller 
at the narrow portion of the taper than they are on its wider portion. 
One might expect, therefore, that the narrow region of the linear taper 
contributes more to the overall radiation loss than its wider portions. 
It appears that the taper could be optimized if larger values of da/dz 
appeared at the wider end of the taper where the canceling effect of the 
sinusoidal functions is still more effective. Following this idea, it is 
possible to show that an exponential taper should distribute the radia- 
tion loss more evenly over its entire length in comparison with the 
linear taper. A linear taper and an exponential taper are shown in 
Fig. 7. The exponential taper was calculated from 


até) = a, + (a, — ae) exp (-4.6 i): 


This taper is designed to equalize the contribution of the integral (36), 
at least approximately, over the entire length of the taper assuming 
that I(p, 2) is constant. The discontinuity of da/dz at 2 = 0 does not 
contribute to the radiation loss. It would, therefore, be of no advantage 
to shape the taper such that da/dz is continuous over its entire length. 

The radiation losses of the linear and exponential tapers are compared 
in Fig. 8. Even though the radiation loss of the exponential taper is less 
than that of the linear taper, in agreement with our expectation, the 
amount of improvement is insufficient to warrant the greater com- 
plexity required to produce such a more complicated taper. Figure 8 
also shows that the radiation loss of a taper is far less than the losses 
caused by an abrupt step. The radiation losses can be made as small as 
desired with a taper of sufficient length. A linear taper with a length to 
waveguide radius (on the larger portion of the guide) ratio of L/a, = 400 
reduces the radiation losses, that would occur on an abrupt step, by a 
factor of 100. With \ = 1 um the value ka, = 2.5 is realized for a, = 
0.4 um so that the taper would have an actual length of L = 160 um 
or 0.16 mm. It is apparent that much longer, more effective tapers 
are feasible. 

Figure 8 indicates that there are two distinctly different regions. 
Below L/a, = 2 the taper is so short that it acts like an abrupt step. 
The beneficial effect of the taper makes itself felt only if the taper is 
long enough. The reduction of the radiation loss of a gradual taper 
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Et = nik: 






0.8 










Fig. 7—The profile of the linear (dotted line) and the exponential (solid line) taper. 


compared to an abrupt step or steep taper is caused by the canceling 
influence of the (complex) exponential function in the integral of 
equation (86). 


2.3 Losses Caused by Random Wall Imperfections 


An important loss contribution is caused by the random deviations 
of the dielectric waveguide boundary from perfect straightness. These 
radiation losses have been investigated for the slab waveguide’ and for 
the circular electric TE); mode.” The theory of radiation losses of the 
dominant mode of the round dielectric waveguide is sketched in 
Section III. 

We have seen that the radiation losses caused by arbitrary deforma- 
tions of the waveguide wall can be computed by describing the wall 
deviation as a series of infinitesimal steps. We have also seen that the: 
single loss for large steps is far higher for the round dielectric waveguide 
than it is for the slab waveguide. We might thus worry that the losses 
caused by random wall perturbations may also be far higher for the 
dominant mode of the round dielectric waveguide. Fortunately, this 
pessimistic expectation is not true. The radiation losses caused by wall 
roughness of the round dielectric rod are no worse than they are for the 
modes of the slab waveguide. 

The random wall losses are treated on the basis of a statistical model. 
The correlation function describing the wall perturbation is assumed 
to be a simple exponential function that is characterized by two param- 
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eters, the rms deviation from perfect straightness A and the correlation 
length B. 

Figure 9 shows a series of curves of the normalized relative radiation 
loss as a function of the ratio of correlation length to waveguide radius 
B/a for a guide with index of refraction n = 1.432 (teflon). The curve 
parameter is the product of vacuum propagation constant times wave- 
guide radius, ka. Also shown for means of comparison is the loss of the 
circular electric mode of the round waveguide as a dotted line. It is 
apparent that the radiation losses of the dominant mode are approxi- 
mately equal to the radiation loss of the circular electric mode. A 
comparison with the results of Ref. 1 shows that the losses of Fig. 9 are 
approximately four times as high as the corresponding losses for the 
slab waveguide. For a meaningful comparison we must remember, 
however, that the slab waveguide losses were computed under the 
assumption that only one of the two slab boundaries was randomly 
perturbed. It seems reasonable to compare the losses of the round rod 
to a slab waveguide whose two walls are perturbed in a correlated way. 
In fact, if we assume that the thickness of the slab waveguide changes 
in a manner that provides equal but opposite displacement of each side 
of the guide we would obtain a four times higher loss than is shown in 
the curves of Ref. 1. The agreement between the radiation losses of the 
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Fig. 8—Relative radiation loss of the linear (dashed line) and the exponential 
(solid line) taper. (n = 1.432, az2/a, = 0.5, ka, = 2.5.) 
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Fig. 9—Normalized radiation losses caused by random ail perturbations. The 
solid lines correspond to the dominant mode of the round guide, the dashed line 
represents the To mode of this waveguide. (n = 1.432.) The curve parameters 
are the values of ka. 


slab waveguide and the random wall losses of the round dielectric 
waveguide is quite close. 

Figure 10 shows similar loss curves for a round waveguide with index 
of refraction n = 1.01. These curves too are about four times as high 
as the corresponding slab waveguide losses for the reason explained 
above. The curves of Fig. 10 are representative of the wall losses of 
a clad optical fiber with 1 percent index difference. As an example let 
us assume that we operate an optical fiber with a vacuum wavelength of 

= 1 ym. The value ka = 15 corresponds to a radius a = 2.4 wm for 
the inner core of the fiber. If we assume that the correlation length of 
the exponential correlation function assumes its worst possible value 
B/a = 2.0, we find from Fig. 10 the normalized loss 


a’ AP 
*L P 


> 





= 0.04. 
A loss factor of 


a= 


= 199 in? =" 10'AB lan 


tle 


would be caused by an rms deviation of the waveguide radius = A 
9-10°° em = 9 A. This example shows how very stringent the 
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tolerance requirements can be. In a realistic case there will not only be 
variations of the waveguide radius. In addition we do not know the 
statistical model of the correlation function that must be applied in 
each case. However, comparison of different correlation function models 
has shown that the peak and its location in Figs. 9 and 10 is not de- 
pendent on the assumed statistical model. The decay of the loss curves 
toward increasing values of B/a is strongly model dependent. 


Ill. THEORY 


3.1 The Dominant Guided Mode 


The field components of an arbitrary guided mode in the waveguide 
are described by the following equations:” 


E, = AJd,(xr) cos vp (3a) 
H, = BJ,(kr) sin vp (3b) 
iL, = —4 Exee + ou ~ J. a) | COS vp (3¢) 
Ey = “ Ez : J,(xr) + wouB Sia) | sin vp (3d) 


Zaz Se 
BETTS 
HHS 





Fig. 10—These curves are similar to Fig. 9 with n = 1.01. The curve parameters 
are the values of ka. 
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L v ; 
H, = —--3 | nae 2 J (xr) + BoB | sin vp (3e) 
K 


H, = -4 cae ,(kr) + BoB ; J 4) | cos vg. (3f) 


These equations describe the field inside of the round dielectric rod, 
r <a. The functions J, are the Bessel functions of order v, a prime 
indicates the derivative with respect to the argument (not with respect 
to r). The parameter v must be an integer in order to make sine and 
cosine periodic functions of the aximuth ¢ with period 27. The factor 


eiatrben (4) 


was omitted from equations (3). The propagation constant fp is related 
to the constants x and the free space propagation constant k by the 
relations 


k? — ( €plto (5) 
and 
Kc = nk’? — B, (6) 


where 7 is the index of refraction of the dielectric material. The con- 
stants A and B are not independent of each other. Their mutual de- 
pendence is given by the boundary conditions for the field components. 
The fields on the outside of the dielectric rod r = a are given by the 
equations 


E, = CH (yr) cos vp (7a) 
H, = DH (iyr) sin vb (7b) 
EK, = 2 | eva.cn > (Gyr) + onD H ern | COS vp (7¢) 


Ey = a Ee : HS” (yr) + inouDH” in) | sin vp (7d) 


= 2 | wes? HH’? (ivr) + inBoDH?” Gn | sin vp (Ze) 


yy 
| 


H,= 2 | roescH?” Gn) B.D” WSC) | Cos vp (7f) 


where H‘” is the Hankel function of order v and of the first kind. The 
prime indicates again its derivative with respect to its argument. The 
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argument is imaginary in order to ensure that the field distribution 
decays exponentially at large distance from the rod. The time and 
z-dependent factor (4) has again been suppressed. The parameter y is 
related to the propagation constant 8) by the equation 


y = Bo — 
The field components were written down quite generally for an arbitrary 


guided mode. The lowest order or dominant mode of the guide follows 
from these equations with 


y = 1, (8) 


The following discussion will be limited to the special case y = 1. The 
connection between the amplitude coefficients and the determination of 
the propagation constant follows from the boundary conditions for the 
field components. The requirement that H, , LH, , H, and H, are con- 
tinuous at the boundary r = a leads to the following eigenvalue equation 
for the determination of the propagation constant 8» of the guided mode 


2ay¥ | Jo(xa) 1 iH? (iva) 
n : Ee = 4| + E H™ ya) i} 


ay” | Fo(xa) 1 tH," (ya) — | Gm? — 1) Bok’, 
{ K Ee 4] a Ez HO (ya) i|} - E ) | (9) 


A few numerical values obtained from (9) are shown in Table I. The 








TasBiLEe I—Some NuMERICAL VALUES OF 6, 














n ~ 1.482 (n? = 2.05) n = 1.01 

ka Boa ka Bot 
0.5 0.50000013 2.0 2.0000001 
0.625 0 .62500485 4.0 4.0000011 
0.75 0.75006586 5.0 5.0000672 
0.875 0.8758141 6.0 6 .0006747 
1.0 1.0043348 7.0 7 .0026448 
1.125 1.13887424 8.0 8 .0064648 
1.25 1.2816903 9.0 9.0121047 
1.375 1.484524 10.0 10.019281 
1.5 1.5970437 12.0 12.03695 
1.75 1.9458015 14.0 14.057344 
2.0 2.3149367 16.0 16.07916 
2.25 2.6937751 18.0 18.101671 
2.5 3.0761411 20.0 20.124481 
2.75 3.458978 23.0 23.158808 
3.0 3.8409082 24.0 24 . 170225 

27.0 27 .204311 
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connection between the amplitude coefficients as a consequence of the 
boundary conditions is stated in the following equations: 


_ —(<)' _Ga)(eay? _ 
B () (60a) (1 z :) 


n [sea) 1) 4 1 femieive) _2 7h 
4 |e 7 | - ya bee ya A (10) 











_ _Jy (ka) 

ae H{” (iya) A (11) 
_ _ Jy (ka) 

D = Figg B- (12) 


It is necessary to know the relation between the amplitude coefficients 
and the power P carried by the mode: 


ps | tte {(ax)*[J2(ka) + J3(xa)] — 20%(«a)} (n’ +h z) 





4 
kB f,o{ HS?’ Gra) | \ : ( Ho a 
+ 7 {(on) Fear +1] + 2?pJi(xa)(1 + . 2 
4 2(H) S (22 SOME. Bo Jat) | ae (13) 
€o A K Y Lo 


Equations (8) through (13) provide a complete description of the guided 
modes of symmetry cos ¢. The lowest order solution of the eigenvalue 
equation (9) is the dominant mode of the round dielectric rod. This 
mode does not experience a cutoff. In principle it can be supported by 
any round dielectric rod of arbitrarily small cross section and arbitrarily 
low frequency. All other modes of the round dielectric waveguide exist 
only above their respective cutoff frequencies. All entries in Table I 
belong to single mode (with cos ¢ symmetry) operation. 


3.2 Radiation Modes of the Round Dielectric Rod 


The number of guided modes that the round dielectric rod can 
support is finite at any given frequency. In order to obtain a complete 
set of normal modes of the structure we need to consider also the 
continuous spectrum of unguided modes. 

Any solution of Maxwell’s equations that satisfies the boundary 
condition is called a mode if its z-dependence (and time dependence) is 
given by equation (4). The guided modes are distinguished from the 
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unguided or radiation modes by the fact that their field distributions 
decay exponentially for increasing values of r outside of the waveguide. 
The radiation modes, on the other hand, extend to infinity. As their 
name indicates they are necessary to describe the radiation field outside 
(and inside) of the dielectric waveguide. Since there is no need to limit 
the functions describing the radiation modes to those that decay 
exponentially in the limit of large values of r we use a combination of 
Bessel and Neumann functions to express the unguided modes. How- 
ever, we must require that the field remains finite on axis at r = 0. 
These considerations allow us to express the unguided solutions of 
Maxwell’s equations as follows: For r S a 


B= BGA ca (14a) 
eee ee (14b) 
B, = —4 {BPI + on? 1.(en)) ee (140) 
pe <, | er 2 (er) + contd sor) cv (14d) 
Hy -4, el . J,(or) + 0G | sin vp (14e) 
H, = 4 awe (er i pa 1.40) | Cos vd. (14f) 


There is now no restriction to the possible values that the propagation 
constant 8 can assume. The relation between @ and o is given by 


co =nk — B’. (15) 
The field outside of the dielectric rod, r = a, is given by 
E, = [HJ,(pr) + IN,(pr)] cos vp (16a) 
H, = [KJ,(pr) + MN,,(pr)] sin ve (16b) 
Ek, = a {eB (HLT(or) + IN%(pr)] 
+ cop = [KJ,(pr) + uN sony} cosvp _— (16 c) 
y= {s? [HJ (or) + IN.(or)] 


+ pwp[KJi(pr) + MN Hon} sinv@ (16d) 
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H, = a jc? [HJ ,(pr) + IN,(pr)| 


+ pB[KJi(er) + MN sar) sinvp — (16) 


Ss 
1 


- {se FC + IN‘(or)] 


+ B é [KJ,(pr) + MN sory} cosvp _— (16 f) 


with 
p =k’ — g’, (17) 


The Neumann functions N, are here expressed in the notation of 
Jahnke-Emde.* The determination of the coefficients of the radiation 
modes is complicated by an interesting phenomenon. The boundary 
conditions provide us with four equations. However, there are six 
undetermined coefficients in the set of equations (14) and (16). Even 
allowing for the fact that the power of the mode can be chosen arbitrarily 
so that one coefficient must remain undetermined by the boundary 
conditions, we have still one more coefficient than the boundary condi- 
tions, combined with the requirement of total power carried by the mode, 
are able to determine. This situation means physically that the sets 
of equations (14) and (16) represent a superposition of two modes that 
could be taken apart. A similar situation would have arisen in the case 
of the slab waveguide had we not been careful to separate the modes 
into even and odd field distributions from the very beginning. The 
present structure does not lend itself to a natural separation of the 
modes into even and odd ones. However, the formal field expressions 
(14) and (16) do, nevertheless, represent a superposition of two possible 
sets of modes. One might try to take arbitrarily either the coefficient 
F or G appearing in equation (14) equal to zero to try to separate out 
the two sets of modes. This procedure is mathematically beyond re- 
proach but it suffers from a practical inconvenience. The resulting sets 
of modes would not be orthogonal. It is very desirable to choose the 
modes in such a way that they are all mutually orthogonal to each other. 
It is therefore necessary to determine the coefficients in a way that 
assures the orthogonality of all the modes. The boundary conditions 
combined with the requirement of mode orthogonality and a certain 
amount of power carried by each mode are still not enough to assure 
a unique solution of our problem. This is not surprising since it is always 
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possible to combine two arbitrary vectors in an infinite number of ways 
into two mutually orthogonal vectors. 

The boundary conditions alone yield the following relations between 
the coefficients 


H=5 (oa) Efe =a 1400)N 00) | 


(nr? — 1)k? 
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Po WEo 


4 B - J.(ca)N oa)G} (18) 


by 
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8 7 J.(00).oa)a (19) 





K = * (ea) DE 8” F(o@N(oa)F 
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+ E (oa)Nj(pa) — mes :(oa)N oe) |a\ (20) 


M = (oa) ODE 9 2 J,(oa)F,(pa)F 


po wy 


DWIlA 


4 E Ji(ca)J,(pa) — J,(oa) J oe) |oh. (21) 


Equations (14), (16) and (18) through (21) are sufficient to satisfy 
Maxwell’s equations and the boundary conditions. The coefficients F 
and G are, so far, completely arbitrary. We consider now two sets of 
radiation modes. The first set is distinguished by using the coefficients 
with subscripts F; and G, while the coefficients of the second set are 
designated by Fz and Gz . The two sets of coefficients must now be 
adjusted to render the two sets of modes orthogonal. One of the infinitely 
many solutions of this problem is 


Hos ad, 
G, G, 
The ratio of F/G, is now no longer arbitrary but is given by 


(22) 





Fi _ (# Pte a ety 
Gy (# ‘) | st + (—n'd)’ +€ +f) (28a) 
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with 
b = © Ji(ca)N,(o2) (23b) 
we o—* B 3,(ca)N (00) (230) 
d = * J{(oa)J,(0a) (23d) 
e = J,(ca)J{(pa) (23e) 
p= GPE J, 00) J.(o0) (238 
g = J,(ca)N{(pa). (23g) 


Equation (23) was already specialized to the mode of symmetry cos 4, 
taking »y = 1. The power carried by the radiation modes is given by 


oe (5) 7 vel =O) (ts) g] + | ( — nd) + i(#2): ¢/ 


+ [e+ @— v(t) S[+ [r+ @- a(t)’ Cle. ew 


The normalization of the radiation modes involves the delta function in 
the same way as it did in the case of the slab waveguides. 


3.3 Radiation Losses Caused by a Step 


It has been shown previously® that the radiation losses of arbitrary 
deformations of dielectric waveguides can be calculated from the 
knowledge of the radiation loss of a step. For simplicity we limit the 
discussion to waveguide imperfections that do not violate the condition 
(2). Condition (2) restricts the waveguide deformations to symmetrical 
changes of the waveguide diameter. More general deformations are far 
more difficult to calculate. 

A step in the round dielectric rod is shown in Fig. 1. We restrict 
ourselves to a dominant mode waveguide. The radius of the larger part 
of the waveguide must be small enough to ensure that only the dominant 
mode of the structure can propagate. Waveguides with larger radii 
suffer conversion losses to other guided modes in addition to the radiation 
losses. Such losses have been studied for the case of the slab waveguide’ 
and for circular electric modes in round dielectric waveguides.” 

The radiation field can be expressed as an integral over all the radia- 
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tion modes. Indicating the modes by script letters with the superscript 2 
for the incident guided mode, r for the reflected guided and radiation 
modes and ¢ for the transmitted guided and radiation modes we can 
write the boundary condition at the step as follows: 


6 + a6 + [ alo) 8) + 2.(0) 82 ()] dp 

= 08? + [ (aloe) + rl 80) de 25) 
86° + a8? + | [ale 8(e) + plo) 82 (0) dp 

= 0.88 + f° ale) 850) + pile) 882(0)] dp 26) 
508 + 0,508” + [” fa,(e)ae'? (0) + px (o)8eS? (o)] do 

= 0908 + fo [auleraet(e) + plese dp 27) 
wes? + a,505" + |” [an(o)5es(e) + pe(o)8eS?(0)] dp 


= ose? + f ” Egela)se$?(p) + pila)3eS?(p)] dp. (28) 


These equations express the continuity of the transverse electric and 
magnetic field components at the step. The field components that are 
shown to be functions of p belong to radiation modes while field com- 
ponents that are not explicitly indicated as functions of p belong to the 
dominant guided mode. The amplitude of the incident guided mode is 
unity. The approximate solution of the equation system (25) through 
(28) follows the same reasoning that was presented for the case of the 
slab waveguide.* The coefficient ¢c, can be calculated by using the ortho- 
gonality of the waveguide modes to the right of the step. The modes 
to the right of the step are not orthogonal to the modes to the left of the 
step because of the different waveguide size. It is thus not possible to 
separate the coefficients g, and p, (which, incidentally, belong to the 
two orthogonal sets of radiation modes) from the coefficient a, of the 
reflected guided mode. This problem makes it impossible to obtain an 
exact solution of the equation system. We neglect the reflected radiation 
modes when we calculate the coefficient c, . This approximation is 
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justified by the fact that for large steps the radiation favors the forward 
direction so that q, and p, can be assumed to be small. For very small 
steps where the ratio of forward to backward scattered power can be 
expected to be more nearly unity we need not worry about the coeffi- 
cients of the reflected radiation modes since the modes of the two guide 
sections become more nearly orthogonal to each other. 

The transmission and reflection coefficients can thus be determined 
approximately with the result 


eels 
* (i, + L)P (29) 
and 
iP — i 
aa a ou 
with 
1 dj 
q, Ge a (6,A, a wuBy) wer As ae B2Bo) Ae 
1 1 (dss 1 (1) 
ie eas ee J i(k); (t720) — "3 J1(k:d_)H; (i722) 
Ky 1 Ky 


1 
— ee (B, A, —_ wpB,) (wer As = B2Bo) J 1 (ky G2) J 1 (kee) 
1 


+ 2 (weyn®B, A, Aa + wu B2B,Bs) 
KyKo(Ky _ Ka) 


[KS 1(k1d2) J o(keG2) — Kod o(k1G2) J 1 (Kode) | 


J, (k2Q2) 
H; ; (i722) 





l 
- Yo (weo8, A, Az + wpuBsB Be) 
2 
1 . : . , 
ig E 3 (it x0,) Ho? (eve) tay J 1(k,a,)Ho(ty2a1) 
Ky os Ye 


ae = [dod o(kyQ2) Hy” (27202) oy a, Jo(exts)H? éyeas)]) 
1 





_ _ dl) (i (Des (Ds 
sr me ie a HO (¢y,a,) tH,” (ty10;)Ho (t7201) 


cs in $"(énsaH2 rsa) |} (31) 


LOSSES IN DIELECTRIC WAVEGUIDES 1687 


and with 
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The indices 1 and 2 attached to the coefficients and parameters indicate 
that the corresponding quantities belong to the waveguide to the left 
of the step (index 1) or to the right of the step (index 2). The coefficients 
A and B are the amplitude coefficients introduced in equations (8), (10) 
and (13). The factor P in equation (29) is the power carried by the 
incident guided mode. It was assumed that the power of all the modes is 
identical. The actual power carried by the mode is accounted for by the 
expansion coefficients a, , G; , Dr» Qt » De , and c, . The power coefficients 
appearing in equations (13) and (29) are also identical. 

The theory of the dominant mode of the round dielectric waveguide 
is far more complex than the corresponding theory of the slab waveguide. 
This explains why the slab waveguide is so much more convenient to 
use for studying the general properties of radiation losses. 

The radiation loss caused by the step is obtained from 

AP 


p =l-lal—la |’. (33) 


However, the same radiation loss can also be obtained by accounting 
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for the power carried away in the radiation modes. We can therefore 
write also 


AP et flO 
p=] dart ip he tas. (34) 


The subscripts r and ¢ have been dropped from the expansion coefficients 
p and q. Both reflected and transmitted radiation modes are automati- 
cally included by extending the integration range from —k to k so that 
backward as well as forward traveling waves are included. The factor 
| 8|/p appearing under the integration sign arose from converting the 
integration variable p to £. 

The theory becomes much simpler when we limit the derivation of 
the p and q coefficients to small steps. It was shown in the work on slab 
waveguides’ that arbitrary deformations of the waveguide wall can be 
treated as a succession of small steps. Even abrupt tapers can be de- 
scribed this way. In the limit of small step height Aa we can write 


da 
Aa = cs Az. (35) 
The expansion coefficients g, and qg, can approximately be obtained 
from equations (25) through (28) by a method that has been explained 
in some detail in Ref. 3. 


: dG SlG Gees 
op) = | (pt) dz. (36) 


The subscript r or ¢t of g is no longer necessary since qg, corresponds to 
negative values of 8 while q, corresponds to positive values of 6. The 
derivation of g has been simplified by expressing quantities pertaining 
to the waveguide to the right of the step in terms of the corresponding 
quantities for the waveguide to the left of the step. This approximation 
involves an expansion of the field quantities in Taylor series keeping 
only the first two terms of the expansion 


Flas) = Fla,) + (2) Ae (37) 


The orthogonality of the modes belonging to the same section of wave- 
guide can be employed to eliminate many terms from the expressions. 
The resulting expressions for [(p, z) is far simpler than it would be had 
we considered a large step. We obtain 
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The derivatives of the amplitude coefficients H, J, K, and M of equa- 
tions (18) through (21) are taken by keeping F and G constant. The 
reason for this prescription is the fact that the terms containing deriva- 
tives of F and G disappear from the equations because of mode ortho- 


gonality. 
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Equation (86) holds for g as well as for p. It is only necessary to insert 
F, and G; in equations (88) through (42) to obtain the q coefficients 
while the p coefficients are obtained by replacing F; , G, with F, , G.. 

In order to use equation (34) for the relative power loss caused by 
radiation, it is necessary to calculate q and p with the help of equations 
(36) and (88). The coefficients appearing in these equations must be 
obtained from equations (39) through (42), and (10), (18), (22), (23), 
and (24). It should be apparent that this theory is of considerable 
complexity and can be handled only on an electronic computer. It is 
sad that the dominant mode in a round dielectric waveguide leads to 
such a complicated theory in comparison with the simple treatment of 
the slab waveguide. 
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3.4 Random Wall Perturbations 


An important source of loss is the radiation that is caused by small 
random perturbations of the waveguide wall. Such radiation losses have 
been discussed for slab waveguides in Ref. 1 and for round dielectric 
waveguides operating with the circular electric guided mode in Ref. 2. 
Equation (86) of our present analysis can be used to calculate the loss 
of the dominant mode of the round waveguide caused by random wall 
perturbations. Since the step losses of the dominant mode of the round 
waveguide are so much higher than the corresponding losses of TE and 
TM modes of the slab waveguide one might fear that the losses caused 
by infinitesimal random perturbations of the waveguide wall may also 
be substantially higher. Fortunately, this is not the case. The losses 
caused by random wall perturbations are of the same order of magnitude 
for all types of dielectric waveguides that have been studied so far. 

The losses caused by random wall perturbations are calculated with 
the help of a statistical model. Instead of using equation (34) for a 
particular waveguide we form the ensemble average (AP/P) over many 
statistically similar systems. For very slight perturbations of the wave- 
guide wall we can assume that I(p, z) is independent of the z coordinate 
and write equation (86), after a partial integration, in the form 


ate) = +408 — Ata) [ate *-* de, (43) 


The argument z has been dropped from J(p) since this function is no 
longer dependent on z. The partial integration had the beneficial effect 
of causing a(z) instead of its derivative to appear under the integration 
sign. It was shown in Ref. 1 how substitution of equation (48) in (84) 
makes the scattering loss dependent on the correlation function 


R(u) = (a)atz — u)) (44) 


after the expectation value has been taken. It is, therefore, possible to 
write the average value of the relative radiation loss as 


(a) =o f Bo — BL Lo) P + | 1%) PF) 12] ig (45) 
with 
F(6) — [ Riu) cos (Bo — B)u du. (46) 


The superscripts 1 and 2 indicate that the function J(p) has been com- 
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puted for both types of radiation modes that are associated with Ff; , Gi 


and F, , G2. 
If we use for the correlation function a simple exponential function 
R(u) = A’ exp (le) (47) 
F'(@) specializes to’ 
F(6) = A (48) 


B| (bo — 8° + a 


IV. CONCLUSION 


We have found that the radiation losses of the dominant mode of a 
round dielectric waveguide are much higher than the corresponding 
losses of TE and TM modes of the slab waveguide. The radiation losses 
of the dominant mode of the round dielectric waveguide with an abrupt 
step have been verified by a millimeter wave experiment. The step 
losses of a ribbon waveguide were also measured and found to lie between 
the losses of the dominant mode of the round waveguide and the TE 
mode losses of the slab waveguide, but closer to the latter. It is thus 
apparent that the slab waveguide can tolerate abrupt steps exceptionally 
well. 

The radiation loss of a tapered round waveguide can be minimized 
by using a gentle taper instead of an abrupt step to accomplish the 
change of the waveguide radius. The losses of a linear taper are only 
slightly higher than the losses of a taper that was designed to equalize 
the loss contributions from different parts of the taper. It appears, 
therefore, that the design of optimum tapers is not profitable compared 
to their greater mechanical complexity. 

The losses caused by slight random wall imperfections are very similar 
for the dominant mode and the circular electric TE); mode of the round 
dielectric rod as well as the TE and TM modes of the dielectric slab 
waveguide. This result is surprising since the step losses of the dominant 
mode of the round waveguide are so much higher than the step losses 
of the slab waveguide. However, this result shows that the radiation 
losses caused by slight random wall perturbations can be studied with 
the help of the simple model of the slab waveguide and the results so 
obtained can be used to evaluate the performance of round dielectric 
waveguides. 
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Excitation of the Dominant Mode of a 


Round Fiber by a Gaussian Beam 


By DIETRICH MARCUSE 
(Manuscript received May 4, 1970) 


The excitation of the dominant HE,, mode of a round optical fiber by a 
gaussian beam has been calculated. The calculation is based on the assump- 
tion that reflected waves can be neglected. It 1s thus applicable only to fibers 
with low index difference between core and cladding. 

It ts found that optimum excitation of the HE,,; mode is achieved for 
loosely guided beams tf the product of the beam half-width w times the 
radial decay constant y of the HE,, mode outside of the guide is unity, 
yw = 1. For tightly coupled modes 2'w must be equal to the core radius in 
order to achieve optimum excitation. As much as 99 percent of the power 
can be transferred to the HE, mode. 

Also investigated are the effects of an off-set or tilted beam on the mode 
excitation. The mode excitation drops to 86 percent if the amount of off-set 
equals the beam half-width. The effect of tilts depends on the parameter kd, 
free space propagation constant times core radius of the fiber. For small 
values of kd or loosely guided modes, the mode excitation ts very sensitive 
to tults of the gaussian beam. As long as the HE, mode ts the only mode that 
can propagate, increasing values of kd lead to less sensitivity with respect 
to talts. For multimode operation of the fiber, the sensitivity to tilts increases 
with increasing values of kd. The minimum of tilt sensitivity coincides with 
the minimum spot size of the guided mode. 


I. INTRODUCTION 


Communication by means of optical fibers requires that light energy 
can be coupled into the fiber in an efficient way. Of the different 
methods of exciting an optical fiber, the simplest consists of shining 
a beam of laser light on the end of the fiber. It is the purpose of 
this paper to investigate the power loss that results at the transition 
from a laser beam propagating in free space to the lowest order 
HE, mode of a round optical fiber. 
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The geometry of the problem is sketched in Fig. 1. It is assumed 
that the fiber core is embedded in an infinite material, its cladding. 
For simplicity it is assumed that the value of the refractive index 
outside of the core is unity. The theory is manageable only if re- 
flections from the end of the fiber are neglected. The transmission 
coefficients are calculated by matching only the transverse component 
of the electric or of the magnetic field at 2 = 0. Finally, an average of 
these two values is taken. 

The incident beam is assumed to have a field distribution of the 
form 


2 
i, = A exp | - (2) | exp (—tkz) for 220 (1) 
and 
e.\ 1/2 
H, = (=) E, (2) 
Mo 
with 
be =z eC as (3) 
0 


Since the field components of the fiber modes are conveniently ex- 
pressed in cylindrical polar coordinates r, ¢ and 2, it is advantageous 
to transform the incident field to these coordinates. 


7, = E, cos¢; E, = —E, sin 9; (4) 


Sh steeacsil FIBER CORE n=1 


Fig. 1—Gaussian beam incident on the core of a dielectric fiber with refractive 
index n. 
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H, = H,sin 9; H, = H, cos¢. (5) 
The amplitude coefficient c; of the HE,,; mode is approximately de- 
termined by the equation 


Td: 1/2 
= ( 4 (6) 
with 
1, = | seg — Bgse*)r dr de (7) 
and 
h= / (StH, — &4H,)r dr do. (8) 


P is the power carried by the incident gaussian mode. The script 
letters indicate the field components of the guided HE, mode,? while 
the other field components belong to the incident gaussian mode. 

The r integrations must be carried out numerically while the ¢ in- 
tegrations can be done analytically even in the more complicated cases 
of an off-set incident field distribution shown in Fig. 2 or a tilted in- 
cident field distribution shown in Fig. 3. 

The field components of the guided modes are described by cylinder 
functions. The arguments of these functions inside of the fiber core at 
r < dare xr with the radial propagation constant « determined by 


Pe ae 8 (9) 
where £ is the propagation constant of the guided mode in z direction. 
On the outside, r > d, the argument of the cylinder functions is yr 
with 

y =p — Fk’. (10) 

The decay constant y determines the rate at which the field in- 


tensity of the guided mode decays outside of the fiber core. For large 
values of r the fields behave like 


exp (—7r). (11) 

Equation (6) for the amplitude transmission coefficient is not exact. 

It was derived under the assumption that reflections at z = O are 
negligible. The power transmission coefficient 7 follows from 


T= lef. (12) 
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Fig. 2—Fiber excited by a gaussian beam off-set with respect to the fiber 
axis by an amount a. 


II. NUMERICAL RESULTS 


We begin the discussion of the dependence of the transmission 
coefficient 7 from the incident gaussian field to the guided HEy 
mode with the simplest case shown in Fig. 1 for a refractive index 
nm = 1.01. The gaussian beam is perfectly aligned with its beam waist 
being coincident with the end of the fiber core at z = 0. The trans- 
mission coefficient as a function of the product yw is shown 
in Fig. 4. Each curve belongs to a different value of kd. 
The normalization of the curves with respect to the radial decay con- 
stant y is convenient since it compresses the dependence of the curve 
on the horizontal axis. The position of the peaks would differ by two 


Fig. 3—Fiber excited by a tilted gaussian beam. 
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Fig. 4—Transmission coefficient 7’ as a function of yw for several values of 
kd and n = 1.01. 


orders of magnitude if the curves were drawn simply as functions 
of w. 

Two remarkable properties can be deduced from Fig. 4. The 
transmission coefficient approaches extremely close to 100 percent. 
The dependence of the transmission peaks as a function of kd is 
shown in more detail in Fig. 5. According to this figure, the transmis- 











Fig. 5—The peak values of the transmission coefficient as a function of kd. 
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sion coefficient can be as high as 99.7 percent. These values are prob- 
ably slightly optimistic as we shall see shortly. 

The position of the transmission peaks can be predicted for two 
regions of operation. For small values of kd the guided mode is 
only loosely supported by the fiber core. Most of the field is on the 
outside decaying according to equation (11). In this case the trans- 
mission curves peak at 


yw = 1. (13) 


This means that the 1/e point of the exponential decay of the mode 
field coincides with the corresponding point of the gaussian curve. 
For X = lp and kd = 5, we have d = 0.8» so that for this example 
1/y = w = 3831p; kd = 10 correspond to 1/y = w = 2.6n. 

The HE,; mode is no longer the only possible guided mode for 
large values of kd. At the value 


2.405 


MOS ae py 


(14) 


the TEo: mode begins to propagate. For n = 1.01, this point appears 
for kd = 17. For tightly guided modes, most of the field energy is 
concentrated inside of the fiber core. In this case, the peak of the 
transmission coefficient occurs at 


w= d/2}. (15) 
For a very tightly guided mode, the propagation constant ap- 
proaches 8 = nk so that we obtain from equations (10) and (15) 


yw = (n® — 1)'kd/2?. (16) 


For n = 1.01, we thus have yw = 0.1 kd. This relationship is mdeed 
apparent in Fig. 4, 

For larger refractive indices of the core, our approximation becomes 
questionable. This breakdown of the approximation is apparent in 
Fig. 6 where n = 1.482. The curve with kd = 3 exceeds the value unity 
very slightly, violating the principle of conservation of power. This 
shows that our approximate values for 7 are slightly too large. 
However, for small values of n — 1, it can be expected that the ap- 
proximation is good because back-scattering of power from the end 
of the fiber core becomes negligible. This expectation is confirmed 
by the fact that none of the curves in Fig. 4 exceeds the value unity. 
It is hard to predict the degree of accuracy of the approximation. 
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Fig. 6—Transmission coefficient T as a function of yw for n = 1.482. The curve 
with kd = 3 exceeds J = 1 indicating a breakdown of the approximation. 


The values of Fig. 4 are perhaps slightly too high but it is clear that the 
power transmission from the gaussian mode to the guided HE, mode 
is very efficient even if it does not quite reach 99.7 percent. 

Since perfect beam alignment cannot be achieved, it is important 
to know how sensitive the transmission coefficient is to misalignments 
of the beam. 

Fig. 7 shows data for the transmission coefficient 7’ as a function 
of the amount of off-set “a” of the gaussian beam shown in Fig. 2. 
The independent variable of Fig. 7 is the product ya. Each curve 
was drawn for its optimum value of yw according to Fig. 4. Fig. 7 
shows that the transmission coefficient decreases to 0.36 if a = w. 
This is a simple relationship that apparently holds for all values of 
kd. An off-set of the gaussian beam is thus not as critical as one might 
have feared. The direction in which the beam is off-set with respect 
to the polarization of the input field has been found to be unimportant. 
The same curves shown in Fig. 7 were obtained for any direction of 
the off-set. 

The dependence of the transmission coefficient on tilts of the mput 
field is shown in Fig. 8. Again w was chosen so that the maximum 
transmission coefficient is obtained in the absence of a tilt. The trend 
of these curves is interesting. The transmission coefficient is very 
sensitive to tilts for small values of kd. This is not surprising since the 
fields extend far from the fiber core so that a slight tilt causes the 
two wavefronts of the input field and the guided mode to become 














Fig. 7—Peak transmission coefficient T as a function of beam off-set. 


seriously misaligned. As the guided mode (and since maximum trans- 
mission is assumed also the input field) contracts, the transmission 
coefficient is far less sensitive to tilts. The least sensitive curve ap- 
pears for kd = 20 in Fig. 8. The next guided mode can be excited 
by the input field as soon as kd exceeds the value 17. As more and 
more guided modes appear, the transmission coefficient to the lowest 
order mode, the HE,,; mode, becomes more sensitive to tilts. The 
best operating point as far as sensitivity to tilts is concerned is ap- 





Fig. 8—Peak transmission coefficient as a function of tilt angle a. 
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parently close to the point where the next guided mode begins to 
propagate. This behavior can be explained as follows. If the wave 
length is kept constant and d is increased, the radial extension of the 
field decreases at first for increasing values of d. However, as d in- 
creases further, the field cross-section increases again. The least 
sensitivity to tilts occurs at the minimum field cross-section. 


III. CONCLUSIONS 


A numerical study of the excitation of the lowest order HE,, mode 
of the round optical fiber by an incident gaussian mode showed that 
the achievable transmission coefficient is very high. The predicted 
optimum value of 99.7 percent may be slightly overoptimistic because 
of the approximate nature of the calculation. However, Snyder 
predicts transmission coefficients as high as 80 percent for the case 
of excitation by a truncated plane wave. The gaussian beam is far 
better matched to the HE;; mode so that a much higher transmission 
coefficient is not surprising. 

An off-set of the peak of the gaussian beam equal to its beam half- 
width w decreases the transmission coefficient to 36 percent. Tilts of 
the input field distribution are more serious for small values of the 
ratio of fiber core radius to wavelength. The least tilt sensitivity is 
obtained under conditions where the HE,; mode is operated close to 
the cut-off frequency of the TE»; mode. The beam cross-section as- 
sumes a minimum at this point. 
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The Capacity of the Gaussian 
Channel with Feedback 


By P. M. EBERT 
(Manuscript received April 28, 1970) 


In this paper we provide a rigorous proof that feedback cannot increase 
the capacity of the channel with additive colored gaussian noise by more 
than a factor of two. We also give a tighter bound showing that any increase 
in capacity ts less than the normalized correlation between the signal and 
noise. It ts further shown that gaussian signals and linear feedback process- 
ing will achieve capacity. 

The practical implications are that (2) feedback should be used to simplify 
encoding and decoding since there is little to be gained in the way of in- 
creased capacity and (iz) the various proposed schemes which use linear 
feedback are doing the correct thing. 


I. INTRODUCTION 


When Shannon first showed that feedback could not increase the 
capacity of a memoryless channel, he mentioned that the capacity 
could be increased when the channel had memory.’ One example of 
such a channel is the additive colored gaussian noise channel with an 
average power limitation on the transmitted signal. We prove here 
that the capacity of this channel is never more than twice the capacity 
without feedback and as the noise becomes white the capacity ap- 
proaches the forward capacity. The limiting case has been attributed to 
Shannon for years and has only recently been rigorously proven.” 

We derive an exact expression for the mutual information between 
the input and output of the channel. The application of different bounds 
to this expression produces twice the forward capacity with the weakest 
bound, or the forward capacity plus the normalized correlation of the 
signal and noise with a slightly stronger bound. It is shown that a 
gaussian signal maximizes the information, and consequently the opti- 
mum feedback technique is linear. 

Our results are based on the model shown in Fig. 1. The added noise 
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MODULATOR 





nt) 





Fig. 1—Channel with noiseless feedback. 


spectrum is normalized to 1 at infinite frequency, is bounded, and has 
an. integrable logarithm. This allows us to represent the noise as in 
Fig. 2. The noise now consists of a white component plus a filtered 
version of the white noise. The imposed restrictions are for mathematical 
purposes only and are of no practical significance. 


Theorem 1: The mutual information between the input and output 
of a channel with additive gaussian noise with spectral density N(w) and 
arbitrary causal feedback processing, as shown in Fig. 1, is given by: 


Him; V2) = 5 [Bald + (0 | m, Ye dt 


es 5 i Es() +2()|Ydt (1) 


where Y, is y(r), 0 S + < t and the expectations are conditioned on Y, 
or Y, and m. a(t) ts a linear causal functional of white noise with the 
properties that: 


ies i ae Oe [ eee 
|1 + H@) |? = N@). 


The two functions w(t) and v(t) are independent Wiener processes. The 
reason for introducing the second term is to make n(t) = z(t) + wt) a 
stationary process. 


(2) 


wit) 
S 





Fig. 2—Model of nonwhite noise. 
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Proof: We first observe that w(t) + z(t) is equivalent to noise with 
spectral density N(w). A causal filter, h(v), will exist whenever N(w) 
represents the square magnitude of a causal filter 


| G@) |? = N@) 
H(w) = Giw) — 1. 
The logarithm of G(w) is 
3 In N(w) + 7B(w) 


where B(w) is the phase characteristic of G(w). The conditions of cau- 
sality, no lower half plane poles, will be met when B(w) is one half the 
Hilbert transform of In N(w). The conditions on N(w) insure that 
In N(w) has a Hilbert transform. 

Now to prove formula (1) we use a theorem due to Kadota, Zaki 
and Ziv’, which we state without proof: 


Theorem A: The mutual information between the input parameter 
m and the output processes Y of a finite power system disturbed by addi- 
tive white gaussian noise 1s 


T T 
I(m; Y 7) — aE | ¢ (t, m, Y,) dt c- aE | E’p(t, mM, Y,)/Y,] dt, 


where o(t, m, Y,) ts the causal modulating function. 

This result is applied to the non-white noise problem by considering 
2(t) to be part of the signal. The inclusion is only useful when one is 
calculating the mutual information; it is not to be included in the 
calculation of transmitter power. Theorem A cannot be applied directly 
since the signal, ¢, which is taken as s(t) + 2(¢) is not completely de- 
termined by m and Y, , but is also a function of the process v(t). To 
find [(m; Yr) we use the decomposition, 


I(m, V; Yr) = In; Vr) + 1(V; V7 | m), (3) 


where V is the process v(r). 
From Theorem A we have, 


I(m, V; Yr) = 3H i [s(t) + 2(O]° dé 


~ iF | Est) +2) | ¥idt 4) 
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and 


1(V;¥r|m) = 48 [9 +2007 at 


— 38 [Bis +a) | Ye, mi) at, 


which together with equation (8) proves Theorem 1. s(t) + z(t) has 
finite energy because s(t) must have finite energy and 2(¢) will have 
finite energy whenever the channel has finite capacity without feed- 
back, as we shall see when we evaluate Z[z’(t)]. With this basic result 
we can derive several interesting corollaries concerning the information. 


Corollary 1: (Pinsker)* Under the conditions of Theorem 1, 


I(m; Yr) 
TF < 2C 


where C is the capacity at the channel without feedback. 
First we observe by equation (38) that 


I(m; Yr) S I(m, V; Yr) 


which is given by equation (4). Furthermore the second term in equa- 
tion (4) is negative and can be ignored, thus 


T 
Im; ¥x) $4B [ +2 au. (5) 
0 
I(m; Y 7) can be further bounded by 
T T 
Im: Y¥) sz sat+z f 2 dt (6) 
0 0 


since (s + z)” S 2s” + 22”. 
The next step is to calculate the variance of z, since this enters di- 
rectly into I[(m; Y,). 


E i " 2) dt = TH), 


E@ =5-/ | He) Pde = 5 f he) = ae 


2 


: dw 


De = 





1 ; NL Tt 
exp | inwe) + § mV) | —1 





* The factor of 2 has been mentioned earlier by Pinsker but no proof has yet 
been published. 
t Indicates the Hilbert transform. 
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—] se 
a =| T= Velde 
Os at een 


_ Re f {exp E InN@) +$ in We) ss iS do. 


Tv 


This latter integral, as chance would have it, is almost identical in 
structure to an integral which arises in evaluating the spectral density 
of a single sideband FM wave (at the carrier frequency) which is 
modulated by a gaussian signal. The quantity 1/2 In N(w) here plays 
the role of the autocorrelation function of the gaussian signal, and 
although for our problem 1/2 In N(w) is not in general an autocorre- 
lation function, the integral may be discussed via the technique used 
in the FM problem (see Mazo and Salz)’. 
Define: 


aa af E Nee = Sn NW) fe = fi) 
then 
£- (GW) — 1] = G@) £F@ = G&) — EPH + 4,7). 
In the time domain this becomes 
Hh menace i Oni eae 


because both h(r) and f(r) are zero for negative r. Both f(r) and A(z) 
are finite for small 7 and thus 


hr = 0)-= fr = 0). 


The integral we are interested in is 2 Re h(r = 0) which is equal to 
Rete =0) = = f in Noda: 
2m J H% 
Thus far we have shown that 
T T 
BY sate | 2’ dt 
-rf ‘ tats [L — N@)] do — 5- =f. In N@w) de. (7) 


One more trick is needed to prove the corollary. We have, up to this 
point, considered only normalized channels which had N(#) = 1. 
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This is valid because normalization cannot affect the ratio between 
capacity without feedback to that with feedback. Some channels can- 
not be normalized in this manner, i.e., N(o) = o or N(o) = 0. 
The latter case has infinite capacity and thus the corollary applies. 
The former presents no problems due to the following lemma. 


Lemma: Consider the channel without feedback. By the water pouring 
argument® we know that the signal energy which achieves capacity obeys: 


0, otherwise. 
Tf we define a new noise N°(w) 
N%) = i N@) SK; 
K, N@) > K. 


This new channel has the same capacity without feedback and a larger 
capacity with feedback. 

Proof: The expression for capacity without feedback is the same 
for N(w) and N°(w). The capacity with feedback can only be increased 
since N°(w) < N(w) for all w. For if the capacity with N(w) were larger, 
one could add a noise with spectrum N(w) — N°(w) at the receiver 
and do just as well as if the noise were N(w). 

We now normalize the noise, N°(w), in order to apply equation (6), 
which makes K = 1. The capacity without feedback is: 


Ee hig 1 
C=] nae 


tee ; 
P=5-[ (1 —N%@)] du. 


With feedback from equations (6), (7) and (8) 


(8) 


vi 
Cee <Ef ¢ dt — TP + 2TC 
0 
or 
p= 2C, 
A tighter bound can be obtained by returning to equation (5) and 


writing: 


t t t 
Gre vy shiz f satu f cal+e | ee dE 
0 0 0 
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which by the preceding argument is equal to 
t 
C+E / sn dt. 
0 


The correlation Esz° is equal to Hsn° because n° and 2° only differ by a 
white component. Thus the capacity can be increased only by the 
correlation of the signal with the noise. The noise n° is not the original 
noise, however the difference occurs only at frequencies not used for 
signaling without feedback. As N(w) becomes white, the energy in 2° 
decreases and consequently Esz° must go to zero. 

More insight into the problem is supplied by the following theorem. 


Theorem 2: Capacity can be attained with a gaussian signal s(t). 
Proof: First we observe that 
Ef{s(t) + 2@ | m, Y.] = s(t, m, Y.) + Le | Wi). 


This is true because s(t) is known given m and Y, , and z(t) is dependent 
on W, which can be calculated given Y, and s(t). E[z(6) | W,] is a linear 
functional of w because w is gaussian. 


Eke) | Wd = i “RG Daa, 


The first term in equation (1) depends only on the correlation prop- 
erties of s(t, m, Y,) and w(r) and therefore we can use a gaussian s of 
the appropriate correlation. For the second term we use the property 
that a least-squares linear estimate has no more energy than the more 
general least square estimate. 


Ex = E# + E(e — 2)? = B® + Ele — 8) 


where € is the least-square linear estimate of x and ¢ is the least-square 
estimate. Since 


K(x — 4° < E@ — #)’, 
WD ay Sina 


Therefore, since E[s(t) + z(é) | Y,] is the least-squares estimate of 
s(t) + 2(¢) given Y, we have 
aN 


i T 1 T aN 
I(m; ¥.) S52] E’ls + 2 | m, yja— SE f (s + 2)’ dl 
0 0 


but for a gaussian signal this inequality is an equality. In addition the 
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signal power is unchanged and the feedback processor need only be 
linear. Therefore one need consider only gaussian input and linear 
processing in calculating capacity. 


II. GENERALITY OF THE MODEL 


The restrictions on N(w) are in fact only needed for N°(w). If a 
noise spectrum is such that the logarithmic integral of N°(w) is minus 
infinity then the capacity of the channel is infinite without feedback. 
Therefore the bound applies to any channel which has a finite capacity 
without feedback. 

The bounds are all valid for noisy feedback as well, however it is not 
clear that gaussian signals are optimum in that case. 
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New Theorems on the Equations of 
Nonlinear DC Transistor Networks 


By ALAN N. WILLSON, JR. 


(Manuscript received March 26, 1970) 


It has long been recognized that equations describing dc transistor net- 
works do not necessarily have unique solutions. The Eccles-Jordan ( flip- 
flop) circuit is an excellent example of one for which the de equations may 
have more than one solution. 

Only recently, however, has a comprehensive theory concerning matters 
such as the existence and uniqueness of solutions of the dc equations of 
general transistor networks begun to take shape. This paper represents 
another contribution to the evolution of that theory. 

A key concept in the development of the recent theory is the concept of a 
“Po matrix.” We give a generalization of that concept, showing that one 
can specify properties possessed by certain pairs of square matrices, analo- 
gous to the properties possessed by a single Py matrix. Pairs of matrices 
possessing these properties are called W, pairs. Use 1s made of this W» pair 
concept to prove resulis which are more general than some of the existing 
ones. We provide an extension of much of the existing theory in such a 
manner that a broader class of dc transistor networks may be considered. 
In particular, the new results provide one with the ability to answer certain 
questions concerning the existence, uniqueness, boundedness, and so on, 
of solutions of the equations for any network which is comprised of tran- 
sistors, diodes, resistors, and independent sources. 


I. INTRODUCTION 


Suppose a network is constructed by connecting in an arbitrary 
manner any number of transistors, diodes, resistors, and independent 
voltage and current sources. Without loss of generality, we may consider 
the network to have the canonical form shown in Fig. 1; that is, we may 
consider the network to be a multiport containing resistors and inde- 
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Fig. 1—Canonical form of a transistor network. 


pendent sources, with transistors and diodes connected to the ports.* 

There are some fundamental questions that one should then, hopefully, 
be able to answer. For example: Do the equations that describe this de 
network have a unique solution? With the exception of certain uniqueness 
results for a special (but none the less important) class of transistor 
networks, all of the previous explicit results in Refs. 1, 2, and 3, which 
have shown methods for obtaining answers to such questions, have 
been concerned only with the class of transistor networks for which, 
after setting the value of each independent source to zero, there exists a 
short-circuit admittance matrix (a G matrix) to characterize the linear 

* It will become apparent that the theory can also accommodate many other struc- 
tures which are of the Fig. 1 type except that the multiport contains additional 


linear elements (such as controlled sources). We do not stress this point though, 
since in the present context such elements seem somewhat unnatural. 
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multiport of Fig. 1. It is the primary purpose of this paper to show how 
that restriction can be removed. We shall show in fact that almost all 
of the previous results are but special cases of results that follow from 
a more general theory in which the assumption of the existence of a G 
matrix for the linear multiport is unnecessary.* 

Section II concerns methods for characterizing a general multiport 
containing resistors and independent sources. In Section III, we consider 
the model for a transistor. An equation for de transistor networks is 
then developed in Section IV and, after explaining some notation in 
Section V, we develop the W, pair concept in Section VI. Sections VII, 
VIII and IX show how the ‘W, pair concept provides a generalization 
of the existing results concerning de transistor networks. Finally, we 
consider an example network in Section X. 


II, LINEAR MULTIPORT CHARACTERIZATION 


A multiport having n ports (an n-port) is characterized by determining 
every combination of the 2n port voltages and currents that the network 
admits (see Ref. 4). We discuss here two methods of characterizing 
multiports that contain resistors and independent sources. The first 
method makes use of the familiar concept of a hybrid matrix. The second 
method uses a pair of matrices in a manner that was apparently first 
suggested—for multiports containing no independent sources—by 
V. Belevitch.’ 


2.1 The Hybrid Formalism 


When the value of each independent source is set to zero, for a multi- 
port containing only resistors and independent sources, the multiport 
becomes, of course, a resistive multiport. H. C. So has proved (as a special 
case of a theorem in Ref. 6) that any resistive multiport has a hybrid 
matrix description. That is, for any resistive n-port, it is always possible 
to label the port voltage and current variables in such a way that there 


* Pragmatists might argue that in any ‘‘physical’’ network, there will always be 
enough “stray” resistance present which, if taken into account, will guarantee the 
existence of, say, a G matrix. It seems to this writer, however, that by taking such a 
point of view, one does not obtain an entirely satisfactory understanding of matters 
(even practical matters). To know that fundamental results do not depend (if, in 
fact, they don’t) upon such fortunate occurrences as these (and for many transistor 
networks this is the case) seems to be the more satisfactory situation. Furthermore, 
it should be noted that in the analysis of a physical network, to obtain a tractable 
problem, it often behooves one to neglect the presence of unimportant elements. 
Thus, it is not necessarily true that such stray resistors will always be present in the 
model of the network which the analyst desires to consider. 
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exists an integer m, 0 S m S n, a pair of n-vectors* 


t= (2, » 005 9 Um y Umer 0" yay 6 

y= Q, an Uae y Cnet joe ia 
and a real n X n matrix H, the hybrid matrix, such that the network 
admits the port variables v, , 7, as the voltage and current, respectively, 
at the kth port, fork = 1, --- ,n, if and only if the vectors x and y satisfy 


y = Hz. (1) 


Thus, a resistive multiport may always be characterized by a hybrid 
matrix. 

When independent sources whose values are nonzero are present 
in an otherwise resistive multiport, a hybrid matrix will not generally 
suffice to characterize the multiport. Clearly the vectors x = y = 
(0,0, --+ , 0)” which satisfy equation (1) for any matrix H do not always 
specify an admissible combination of port variables when independent 
sources are present. One might hope, however, that a characterization 
of the type 


y = Hx-+¢, (2) 


where c is some constant vector (whose elements are real numbers), 
might always be possible. Indeed, we are about to show that this is the 
case. There is one problem, however, that was not present in the con- 
sideration of resistive n-ports that must first be dealt with: there are ways 
to interconnect independent sources and resistors such that the resulting 
structure doesn’t make sense. That is, the independent sources might 
impose self-contradictory constraints on the network. We rule out such 
possibilities by agreeing that, when we refer to “a multiport containing 
resistors and independent sources,” we always assume that the multiport 
possesses the following property: 


Assumption: The linear graph that is formed by associating an edge 
with each resistor, each independent source, and each port, has no 
cut-sets containing only current source edges for which the values of 
the current sources cause a violation of Kirchhoff’s current law. Similarly, 
no circuits of voltage source edges for which the values of the voltage 
sources cause a violation of Kirchhoff’s voltage law are present. 


This assumption in no way restricts the generality of our work. We 


* We use the superscript 7’ to denote the transpose of a vector or a matrix. Thus, 
the vectors x and y above are both column vectors. 
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are simply ruling out multiports, like the 2-port of Fig. 2, for which the 
set: of admissible port voltage and current combinations is empty. 

We have worded the Assumption so that the presence of, say, a series 
connection of two l-ampere current sources in an otherwise resistive 
multiport does not cause the multiport to be inadmissible. We have 
done this because no violation of Kirchhoff’s laws results from such 
interconnections of resistors and sources; the network is perfectly legiti- 
mate. One should be aware, however, that if ‘‘superfluous” sources are 
present in a network, it will follow that one cannot uniquely determine 
the value of each branch voltage and current in the network. That is, 
even though one might be able to uniquely determine the value of the 
voltage across the pair of 1-ampere sources, there is no way to determine 
the value of the voltage across each individual source. Aside from such 
ambiguities, it follows (see below and the proof of Theorem 1 in Ref. 6) 
that the value of all branch voltages and currents can be uniquely deter- 
mined for a multiport satisfying the Assumption, whenever the values 
of the “independent” port variables are known. 


Theorem: Any multiport containing resistors and independent sources 
can be characterized by equation (2), where H 1s a hybrid matrix charac- 
terization of the corresponding resistive multiport that is obtained by setting 
all independent source values to zero, and c is a vector of real numbers. 


A proof of this theorem can be constructed by incorporating a few 
simple observations and minor modifications into the arguments used 
by So in Ref. 6. We therefore simply sketch the main ideas: First, if the 
linear graph mentioned in the Assumption contains any current source 
cut-sets, then it must be the case (because of that Assumption) that 
these sources have values such that Kirchhoff’s current law is satisfied. 
That being the case, the port behavior of the multiport will clearly be 
unaltered if a sufficient number of current sources are removed (by 
coalescing appropriate nodes) to eliminate such cut-sets. A similar 
observation applies to voltage source circuits. Therefore without any 
loss of generality, we may consider the linear graph to have no current 
source cut-sets and no voltage source circuits. Next, by Lemmas 1 and 
2 of Ref. 6, it then follows that there exists a tree* for the linear graph 
for which all voltage source edges are branches and all current source 
edges are links. At each port, one of the two port variables is then desig- 
nated as “independent,” the choice depending upon whether the edge 
corresponding to that port is a branch or a link. The existence of the 


* In case the linear graph is not connected each reference to the word tree should, 
of course, be changed to forest. 
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Fig. 2—An inadmissible n-port. 


hybrid matrix H and the vector c for the characterization (2) then follows 
in the same manner as the existence of a hybrid matrix for a resistive 
multiport follows from So’s arguments. 


2.2 Belevitch’s Formalism 


For some multiports, it might be that (after setting all independent 
source values to zero) a hybrid matrix exists such that the vectors x 
and y in equation (1) satisfy « = v = (v,,-°+,,) andy =71= (4,---, 
7,)’. In this case the hybrid matrix is given the special name, admittance 
matrix. Similarly, if it happens that H exists such that x = zand y = 2, 
then H is called the zmpedance matrix. For many resistive multiports, 
neither an impedance matrix nor an admittance matrix exists. It is still 
possible, however, to characterize any n-port for which a hybrid matrix 
exists in terms of the vectors v and 7. Obviously, x and y satisfy equation 
(1) if and only if v and 7 satisfy 


(1, : —H, = [H.} -Ll, (3) 


where the n X m matrix H, and the n X (n — m) matrix H, are defined 
by H = [H,: H,], and similarly [J, ‘I,J is the n X n identity matrix. 

The characterization (3), being equivalent to equation (1), is perfectly 
adequate for any resistive n-port. It is, however, but a special case of a 
more general characterization due to Belevitch, namely: 


Pv = Qi, (4) 


where P and Q are n X n real matrices. Belevitch’s characterization 
can be used for quite a broad class of networks, including some rather 
pathological ones which require dependent sources, or gyrators and 
negative resistors to realize, and for which no hybrid characterization 
exists. For example, the one-port called a norator, for which the set of 
admissible port voltage and current combinations is the set of all pairs of 
real numbers, may be characterized by [0]v = [O}e. We should note, 
however, that if one allows the aforementioned elements to be present 
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in an n-port, then even equation (4) cannot always provide a charac- 
terization. The nullator, for example, a one-port whose only admissible 
combination of port voltage and current variables is the pair (0, 0), is 
such an n-port. 

When an »-port contains independent sources it can often be charac- 
terized by the equation 

Pv =Qi+ce, (5) 

where P and Q are real n X n matrices, and cis a constant vector. Clearly, 
any n-port containing only resistors and independent sources has such a 
characterization. It is this class of n-ports which is our primary concern. 
in the study of transistor networks. We note, however, that equation 
(5) is adequate for characterizing a much broader class of n-ports. 


III. NONLINEAR TRANSISTOR CHARACTERIZATION 


In Fig. 8, a commonly used large signal de transistor model is dis- 
played. It is easily verified that the voltage and current variables defined 
in that figure obey the following relationships: 

1 
f:1) (6) 
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Fig. 3—Large signal de transistor model. 
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Each of the parameters a, and a, may assume any value in the open 
interval (0, 1). The parameters 7, , r, , and r, , which account for lead 
resistances, are sometimes omitted by device modelers (their presence 
is sometimes accounted for by including appropriate additional resistors 
in the network to which the transistor model is connected). To accom- 
modate these various points of view we specify only, therefore, that the 
values of the parameters 7, , r, , and r, be nonnegative. Thus any or all 
of them may be zero. 

Depending upon whether the transistor being modeled is a pnp or 
an npn, the graph of each of the functions f, and f, has one of the general 
shapes shown in Fig. 4 (at least for values of |v| that are ‘‘not too large’). 
Often these functions are described by an equation of the form 


f.(v) = mlexp(nv) — 1], (k = 1, 2), (8) 


where m, and n, are appropriately chosen constants, both being positive 
for a pnp transistor, and both negative for an npn. On the other hand, 
for example, a piecewise-linear representation is sometimes specified 
for f, and f.. 

The nature of the functions f, and f, for large values of |v| depends 
upon which assumptions the modeler is willing to make, and which 
effects he is interested in considering. For large negative (in the pnp 
case) values of v, for example, the graph of f, approaches—according 
to equation (8)—the horizontal asymptote 7 = —m,. Thus, if the 
modeler chooses to use equation (8) to describe f, for all values of », 
the range of f, will not be the entire real line. If, on the other hand, the 
effect of ohmic surface leakage across the p-n junction is included in 
the model, the graph of the function f, will approach asymptotically 
a straight line having a small, but positive, slope. The range of such 
a function is, obviously, the whole real line. One might also wish to 
include the effect of avalanche breakdown in the reverse-biased region. 
If this is done, the graph of f, will have a shape reminiscent of that of 
a Zener diode in the v < 0 part of its domain. 

In the forward-biased region there are also effects, particularly apparent 
for large values of v, which the modeler may or may not wish to recognize. 
For example, there is the so-called high-level injection phenomenon 
which tends to decrease the value of the forward current and which, 
using equation (8), is usually accounted for by a decrease in the magni- 
tude of n, for large values of v. In addition, there is the effect of the ohmic 
resistance of the crystal which tends to reduce the value of forward 
current for large values of v. 

From the point of view of the device modeler, the question of whether 
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or not to include some of the effects mentioned above is often a minor 
issue. For many networks the behavior will be essentially the same 
whether or not, say, surface leakage is accounted for in the transistor 
model. From the point of view of the network analyst, however, the 
situation is somewhat different. For example, the matter of whether 
or not the functions f, map the real line onto the real line can, in some 
cases, make the difference between whether or not there exists a solution 
of the network’s equations. Similarly, other results that have been 
obtained recently (presented later, beginning in Section VII) also seem 
to depend upon the graphs of the functions f, having certain special 
properties. 

It seems safe to say that no matter which “special effects’ are included 
(or omitted) in the description of the transistor, the functions f, may 
at least be considered to be strictly monotone increasing mappings of 
the real line into itself. For the purpose of formulating the equations 
for transistor networks, this is the only hypothesis that we shall make. 
When additional hypotheses regarding the nature of these functions 
are needed (to obtain certain results concerning properties of these 
equations) those hypotheses will be mentioned explicitly. In each case 
it will be clear that the additional hypotheses are, in some appropriate 
sense, rather weak. 

Similar remarks can be made for the diodes that are shown in Fig. 1, 
which might also be present in transistor networks. Thus, we assume 
that each diode is described by an equation of the type 7 = f(v) where, 
at this point, we only assume that the function f is a strictly monotone 
increasing mapping of the real line into itself. 


IV. EQUATIONS FOR TRANSISTOR NETWORKS 


Suppose we are given a de network consisting of transistors, diodes, 
resistors, and independent voltage and current sources, connected to- 
gether in an arbitrary manner. Let there be n transistors and d diodes. 
Clearly, there is no loss of generality if we consider the network to be of 
the type shown in Fig. 1. Using the results of Section III, we may describe 
the nonlinear devices in the network by the equations 


y=TF(z), 2=24-— ky, (9) 


where T = diag[T, , 7], with T, a block diagonal matrix with n2 X 2 
diagonal blocks of the form 


i os” 
"| for k= 1,--- ,n, (10) 


(k) 
—~y 1 
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Fig. 4—General shape of the functions f; and f2; (a) pnp transistor, (b) npn transistor. 


and 7, the d X d identity matrix. Also, R = diag [R, , R.], with R, a 
block diagonal matrix with n 2 X 2 diagonal blocks of the form 


(k) (k) (k) 
F ts |; k= 1, rer yn, (11) 
(k) 


(k) (k) 
Ys idles eal cs 


and Re the d X d matrix whose elements are all zeros. The function 
has the form F(x) = [f: (x1), «++ , fonsa(Xonea)]’, Where each of the f, is a 
strictly monotone increasing mapping of the real line into itself. 

Using the results of Section II, the effect of the linear multiport in 
Fig. 1 is to constrain the vectors of port variables, € and y, to obey the 
relationship 


Pz = —Qy+c, (12) 


where P and Q are (2n + d) X (2n + d) real matrices and c is a real 
(2n + d)-vector. The minus sign appears in equation (12) as a consequence 
of having chosen the reference direction for the port currents (the elements 
of the vector y) to be opposite to that which is usually assumed. 

By using equations (9), we may easily eliminate the variables @ and 
y from equation (12), resulting in the equation 


(PR + Q)TF(x) + Px =c. (13) 


The central problem in determining the values of all branch voltages 
and currents in a de transistor network is the determination of a solution 
of equation (13). The rest is relatively straightforward, forif xis a (unique) 
solution of equation (13), then the (unique) vectors £ and y, such that 
equations (9) and (12) are satisfied, may immediately be computed 
from equations (9). 

Since the matrix J’ is nonsingular, it follows that whenever either 
(PR + Q) or P is nonsingular, equation (13) can be transformed into, 
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respectively, one of the equations 
F(x) + Ax = 6, (14) 
AF(x) + x2 = b. (15) 


The first of these equations has been studied rather extensively (see 
Refs. 1-3 and 7) and for most of the results obtained there, it can be 
shown that parallel results are possible for equation (15). Both of these 
equations, however, are but special cases of the equation 


AF (xz) + Bu =c, (16) 


which accommodates equation (13) directly. It is, therefore, this equation 
to which we shall now direct our attention. It will be shown that most 
of the results which have been obtained to date for equation (14) have 
rather natural (though not obvious) extensions to equation (16). It is 
important that such extensions be possible because one is often forced 
to deal with equations like (16) in the analysis of transistor networks. 
Clearly, this is the case whenever both of the matrices (PR + Q) and 
P of equation (13) are singular—and this can easily happen (for example, 
if the matrix FR contains all zeros, then it will happen whenever there 
exists no admittance matrix nor impedance matrix for the linear multi- 
port of Fig. 1). 


V. NOTATION 


The following notation shall be used throughout the remainder of 
the paper: For each positive integer n we denote by 4” the n-dimensional 
Euclidean space, the elements of which are ordered n-tuples of real 
numbers, which we consider to be column vectors. The origin in E” is 
denoted by 6. If z = (21, °°: ,2,)" andy = (y:,°°+, Yn)’ are elements 
of EZ” we denote their inner product by (z, y) = >.%_, 1.y,. The norm of 
each « © E” is denoted by ||x|| = (x, x)’. 

If Aisann X n matrix, then for k = 1, --- , n, A, denotes the kth 
column of A. A principal submatrix of a square matrix A is any square 
submatrix of A whose main diagonal is contained in the main diagonal 
of A. A principal minor of A is the determinant of any principal sub- 
matrix of A. If D is a diagonal matrix, then D > O means that each 
element of the main diagonal is a positive number; similarly, D = 0 
denotes that each element of the main diagonal is nonnegative. We 
denote the n X n identity matrix by either J, or, when the dimension 
is unimportant or is clear from the context, simply by J. The direct 
sum of two matrices A, B is denoted by A @ B. A square matrix of real 
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numbers A is said to be strongly row-sum dominant if its elements 
a;; satisfy a;; > >.;«; |a;;| for? = 1,---,n. 

If f is a real valued function defined on E’ then f is said to be monotone 
increasing if for all « < y it follows that f(x) S f(y). We say that f is 
strictly monotone increasing if f(x) < f(y) for alla < y. For each positive 
integer n, we denote by 5” that collection of mappings of 4” onto itself 
defined by: F € &" if and only if there exist, forz = 1, --- , n, strictly 
monotone increasing functions f; mapping HZ’ onto EZ” such that for 


each x = (4,,°°+,2,)" © EB", F(z) = [f,(a1), «++ , fala)’. 


VI. PAIRS OF MATRICES OF TYPE Wo 


Many of the recent results referred to above, concerning equation 
(14), have relied heavily upon certain properties that a matrix is known 
to possess whenever it is a member of a class of matrices that has been 
given the name P, . In a similar way the results that follow rely upon 
useful properties that are possessed by certain pairs of matrices. We 
shall define a class, the elements of which are these pairs of matrices, 
and give it the nameW, . 

The class of matrices called P,) was defined by M. Fiedler and V. 
Ptdk.* They proved that the following properties of a square matrix of 
real numbers, A, are equivalent: 


(z) All principal minors of A are nonnegative. 
(zt) For each vector x ~ 6 there exists an index k such that x, # 0 
and 2,(Az), = 0. 
(722) For each vector « ~ @ there exists a diagonal matrix D, = 0 
such that (rz, D,x) > 0 and (Az, D,x) = 0. 
(wv) Every real eigenvalue of A, as well as of each principal submatrix 
of A, is nonnegative. 
Sandberg and Willson proved that another property can be added to 
this list of equivalent properties,’’* namely: 
(v) det (D+ A) ¥ 0 for every diagonal matrix D > 0. 


The class of all matrices possessing one (and hence all) of the above 
properties is called Py. 

We shall now state a theorem which provides a useful generalization 
of the concept of the class of P, matrices. 


Definition: For each pair of n X n matrices (A, B) we shall denote by 
C(A, B) the collection of all the n X n matrices that can be constructed 
by juxtaposing columns taken from either A or B while maintaining 
the original relative ordering of the columns. Thus, M € @(A, B) if 
and only if for each k = 1, --- ,n, either M, = A, or M, = B,. 
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Obviously @(A, B) contains 2” matrices (for certain pairs (A, B)— 
namely for those having A, = B, for one or more values of k—it can 
happen that two or more matrices in @(A, B) are identical). 


Definition: The pair of n X n matrices (M, N) is said to be a comple- 
mentary pair taken from @(A, B) if and only if both M and N are members 
of C(A, B) and for each k = 1, --- , n, either I, = A, and N, = B,, 
or else M, = B, and N, = A,. 


It is obvious that (A, B) is a complementary pair taken from C(A, B). 
It is also clear that C(A, B) = @(B, A) and, moreover, that if (M, N) is 
any complementary pair taken from @(A, B), then C(M, N) = C(A, B). 
Furthermore, for each M € @(A, B) there exists N € @(A, B) such that 
(M, N) is a complementary pair. 


Theorem 1: The following properties of a pair of n X n matrices of real 
numbers (A, B) are equivalent: 


(t) det (AD + B) ¥ 0 for every diagonal matrix D > 0. 

(ti) There exists a matrix M € C(A, B) such that det M ¥ 0 and such 
that det M-det N = 0 forall N € C(A, B). 

(iit) For each vector x € 6 there exists an index k such that either (A7x), 4 
0 or (B* x), 4 0, and such that (A7x),(B7x), = 0. 

(iv) For each vector « ~ @ there exists a diagonal matrix D, 2 0 such 
that either (A*x, D,A7x) > 0 or (B’x, D,B’x) > 0 (that is, 
such that (A7x, D,A7’x) + (B7x, D,B"x) > 0), and such that 
(A?x, D,B*x) = 0. 


(v) For each complementary pair of matrices (M,N) taken from 
C(A, B), each real value of d that satisfies det(M — \N) = 0 is 
nonnegative. 

(vt) There exists a complementary pair of matrices (MM, N) taken from 
C(A, B) such that MN € P,. 

(vit) There exists a matrix M € C(A, B) such that det M # 0; and, 
for any complementary patr of matrices (M, N) taken from C(A, B) 
with det M #0,M "NE Py. 


In this paper, we do not make use of properties (zi), (iv), or (v) of 
Theorem 1. The proof that the remaining four properties are equivalent 
is given in the Appendix. A complete proof of Theorem 1 is given else- 
where.” 


Definition: The class of all pairs of matrices which possess one (and 
hence all) of the properties listed in Theorem 1 is called‘W, . 
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To see that properties (z) and (72) of Theorem 1 are in fact generaliza- 
tions of the previously mentioned properties (v) and (7), respectively, 
that define Py is a simple matter. It happens that for any n X n matrix 
B the pair (J, , B) EW, if and only if B € P, . (This follows from pro- 
perty (viz) of Theorem 1.) With our attention restricted to pairs of 
matrices of the type (J, , B), it is clear that property (z) of Theorem 1 
is equivalent to property (v) which determines those matrices B that 
that are in P, . Concerning property (iz) of Theorem 1, an arbitrary 
matrix NV € C(I, , B) is either the matrix J, or else, a matrix formed from 
B by replacing some of the columns of B by the corresponding columns 
of I, . Consequently, det N = det By where By is the principal sub- 
matrix of B that is formed by removing from B the columns that are 
not present in NV and then removing the corresponding rows. Hence, 
since det I,, ~ 0, we may take J, to be the matrix // in property (zz) of 
Theorem 1, and observe that this property then becomes: det By = 0 
for all N € Cd, , B). It is now clear that this property is equivalent 
to the property (7) that defines the class of Py) matrices. (Note that there 
are exactly 2” — 1 principal minors for each n X n matrix, and that 
the set C(Z, , B)\{J,,} contains exactly 2” — 1 members.) 


VII. THEOREMS ON EXISTENCE AND UNIQUENESS 


7.1 First Existence and Uniqueness Theorem 


The following theorem, which is proved in Ref. 2, provides a necessary 
and sufficient condition for the existence of a unique solution of equation 
(14) for all F that are strictly monotone increasing “diagonal”? mappings 
of £” onto E” and for allb € &”. 


Theorem 2: If A is ann X n matrix of real numbers, then there exists a 
unique solution of equation (14) Ma each F € §" and for each b € EK” tf 
and only if AE Po. 


Using this theorem along with the results of Section VI we can prove 
the following (more general) theorem. 


Theorem 3: If A and B are n X n matrices of real numbers, then there 
exists a unique solution of equation (16) for each F € §” and each c € E” 
if and only if (A, B) C Wo. 


Proof: (if) Let (A, B) € W,. Then, by Theorem 1, there exists a com- 
plementary pair (17, N) taken from @(A, B) such that W-'N € P,. 


For each F = [f,(-), +++, faC))" © BW let @ = [n(-), +++ , ga(-)]” denote 
the mapping (also in §”) defined by 
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g(-) = ee a a. for kK =1,--- ,n. 
hoy it Mee Ag, 
Clearly, the vectors x and y satisfy 
AF (x) + Br = MG(y) + Ny 


if they satisfy the relation 


ney Sd eae te eg eee eC 
f(x) if M, A A, ) 


and since this relation defines a homeomorphism of &” onto itself, it 
follows that there exists a unique solution of equation (16) for each 
c © EK” if there exists a unique solution of the equation 


MG(y) + Ny =c (18) 


for each c € E”. But, that this is so follows immediately from Theorem 
2 and from the fact that 17-’*N € P,. 

(only if) Suppose (A, B) EW, . Then, by Theorem 1, there exists a 
diagonal matrix D > 0 such that det(AD + B) = 0. Choosing F(x) = Dz, 
we have F € $", while equation (16) does not have, with this choice of 
F, a unique solution for alle € £”. O 

There are corollaries to Theorem 2, given in Ref. 2, that also may be 
generalized in a similar manner. For example, the following result is a 
generalization of an important special case of Corollary 1 of Ref. 2; 
it shows that the condition (A, B) € W, is still sufficient to insure the 
uniqueness of a solution of equation (16) (Gf a solution exists) even 
when the mapping F is not onto. 


Theorem 4: If F(x) = [f,(a1), «++ , fx(an)]", where each f, is a strictly 
monotone increasing mapping of E* into E*, and if (A, B) CW , then 
there exists at most one solution of equation (16) for eachc € E”. 


Proof: Suppose that, for some c € E", x and x’ are solutions of equation 
(16) with 2’ — 2” ¥ 6. Then, A[F(2’) — F(x’)] + Bla’ — x’) = 0. But 
then, since F is a strictly monotone increasing ‘‘diagonal’”’ mapping, 
there exists a diagonal matrix D > 0 such that F(a’) — F(z’) = 
D(x’ — x”), and hence (AD + B)(x* — x’) = 0. Since 2’ — 2” ¥ 6 it 
follows that det(AD -- B) = 0, which implies that (A, B) EW,. Oo 


7.2 A Nonuniqueness Theorem 


From the proof of the “only if” part of Theorem 2 (given in Ref. 2) 
it follows that whenever A €& P, , there exists a mapping F' € $” and a 
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vector b € E” such that equation (14) has more than one solution. On 
the other hand, even if A & P, , if the mapping F € 5S” is “‘fixed,” then 
it is easy to see that the nonuniqueness of solutions of equation (14) 
need not necessarily follow for any b € E” [take F(x) = rand Ax = —2z, 
for example]. I. W. Sandberg has shown,”” however, that if one assumes 
that the “fixed”? mapping F has another special property, rather than 
assuming that ’ € s”, then the nonuniqueness of solutions of equation 
(14) follows, for some b € E”, whenever A ¢ P, . Moreover, he has 
shown that under these hypotheses and for any 6 > 0, there exists some 
b € E" such that equation (14) has two solutions, x and y, which satisfy 
\|e — y|| = 6. The special property that F is assumed to have is given in 
the following definition (in words, the property is: that it be possible 
to draw a straight line having any given positive slope, and any given 
length, between some pair of points on the graph of each of the functions 


fi). 


Definition: For each positive integer n we denote by &” that collection 
of mappings of Z” into itself defined by: F € &” if and only if there exist, 
fork = 1, --- , n, continuous functions f, mapping EH’ into Z* such that 
foreach xz € E”, F(x) = [f:(a1), «++» fa(an)]”, with each of the f, satisfying, 
for all 8 > 0, 


inf {fa + B) — fila): —-2 <a< o} 
sup {f(a + 8) — fila): —0 <a< ~} 


By using Theorem 1 it is possible to prove the following generalization 
of Sandberg’s result: 


Theorem 5: Let F € &", let (A, B) EW, be a pair of realn X n matrices, 
and let 6 be a positive constant. Then, for some c € LE” there exist solutions 
of equation (16), x and y, satisfying ||z — y|| = 6. 


Proof: Since (A,B) €& W, there exists a diagonal matrix D = 
diag(d, , --: , d,) > 0, such that det(AD + B) = 0. Therefore, there 
exists «* € E”, with ||x*|| = 6, such that (AD + B)z* = 6. Since F € &" 
there exists x € E” such that 


fi(tx) — f(a, — 2%) = vid, , for k= 1,-+-,n. 
Let c = AF (x) + Ba, and let y = x — x*. Then 
A[F(@) — FYy)] + B@ — y) = ALF@) — F@ — 2*)\ + Ba* 
(AD + Bjx* = 0. D 


For a mapping F' to be a member of &", it is not necessary that F € $”. 
It follows from the above definition of &" that F © &” implies that each 


ll 


0, 


oO, 
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of the functions f, is a monotone increasing function from EZ’ onto some 
interval in HZ’ whose length is infinite; the f, need not, however, be 
strictly monotone increasing, nor onto Z'. For those F € 8&” for which 
each of the functions f, is strictly monotone increasing, we have the 
following corollary to the two preceding theorems. 


Corollary: Let F(x) = [f:(a1), «++ 5 fa(@n)|” € 8” and let each of the functions 
f. be strictly monotone increasing. Then there exists at most one solution 
of equation (16) for each c € E” tf and only if (A, B) ECWo. 


VIII. RESULTS ON CONTINUITY AND BOUNDEDNESS 


For many systems whose behavior is described by an equation having 
the form (16), the vector c may be regarded as the system’s input and 
the vector « may be regarded as the system’s response or output. Those 
properties that one might expect well-behaved systems to possess are 
likely to include continuity and boundedness. Thus, one might expect (2) 
“‘small”’ changes to result in the value of the system’s output when 
“small” changes are made in the value of the system’s input, and (22) a 
bounded sequence of input vectors to yield a bounded sequence of 
outputs. We now show that such properties are indeed possessed by 
the type of system that is the main concern of this paper. 


8.1 Continuity 


When the n X n matrix A isa member of the class P, and the mapping 
F € &", it follows that the solution « of equation (14) is a continuous 
function of the (input) vector b.” Using this fact, it is easy to prove the 
following theorem. 


Theorem 6: For each F € &" and each pair of n X n matrices (A, B) ECWo 
the solution x of equation (16) is a continuous function of the vector c. 


Proof: Proceeding as in the “if”? part of the proof of Theorem 3, we see 
that the theorem follows immediately from the facts that equation 
(17) is a homeomorphism and that the aforementioned result guarantees 
that y, the solution of equation (18), is a continuous function of c. O 


8.2 Boundedness 


In Ref. 2 a theorem (Theorem 5) is proved which shows that, when 
F € "and A € P,, bounds can be obtained for the solution of equation 
(14) whenever bounds for b € &” are given. The proof of a more general 
theorem concerning equation (16) can be constructed quite easily by 
using that theorem, and by using the same technique that was used in 
the proof of the preceding theorem, along with the trivial observations: 
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(2) For anv nonsingular n X n matrix of real numbers, /, and any 
real numbers a; S 8; ,7 = 1, --- , n, there exist real numbers, 
al S 64,7 = 1, --- , , such that when each of the components 
c; of the vector c satisfies a; S c; S 6; , it follows that 
at < (M~*c); < Bt, for? =1,---,n. 

(it) For any given real numbers y; S 6; ,7 = 1, --- , , there exist 
for the homeomorphism (17), real numbers y/ S 6/,7 = 1, ---,n, 


such that whenever 2, y satisfy equation (17) with y; S y; S 6; , 
fort = 1, --- ,n, it follows that y; S v7; S 6/,for?7 =1,--- ,n. 


The more general theorem, whose quite obvious proof is omitted, 
is the following: 


Theorem 7: Let F © S", let (A, B) EW, be a pair of n X n matrices, and, 


fori =1,---,n,leta; S B; be given. There evist, fori = 1, --- , n, real 
numbers y; < 6; such that for anyc = (c:,°+:,¢,)” C E” witha; Sc; S B; 
fort = 1, --- , n, tf x satisfies equation (16), then y; S x; S 46; fort = 
Tete aie: 


According to Theorem 7, (A, B) € ‘W, is a sufficient condition for a 
bounded sequence of vectors c to yield a bounded sequence of solution 
vectors of equation (16), for all F € &”. The following theorem shows 
that (A, B) CW, is also a necessary condition. 


Theorem 8: If (A,B) isa pair of realn X n matrices, then (A, B) E'Wo tf 
and only if for each F CS" and each unbounded sequence of points x*,x",x*, ++: 
in KE”, the corresponding sequence c’, c’, c’, --* [c’ = AF(a") + Ba", 
k = 1, 2,3, ---] 2s unbounded. 


This theorem, which is a generalization of Theorem 4 of Ref. 2, can be 
proved in a manner which is a quite obvious generalization of the proof, 
given there, of that theorem. Thus, an appeal to Theorem 7 proves the 
“only if” part, and the “if” part is proved by assuming that (A, B) EW, 
and then choosing the same kind of mapping F € &” as was chosen in 
Ref. 2, for which an unbounded sequence of vectors x” yields a bounded 
sequence of vectors c’. 


IX. COMPUTATION OF THE SOLUTION 


A. Gersho’ has shown that whenever F € 5” \ C’ (that is, whenever 
each of the functions f, is a continuously differentiable strictly monotone 
increasing mapping of the real line onto itself), it is possible to compute 
the solution of equation (14), forany A € P, and any b € EZ”, by making 
use of a gradient descent algorithm due to A. A. Goldstein."* The following 
theorem extends this result to the class of equations of the type (16). 
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Theorem 9: Let M be an arbitrary positive definite symmetric matriz, 
and let Q : E” — E" be defined by 
Q(x) = [AF(z) + Bx — c|*"M[AF (a2) + Bx — cl, 


where F ES" (\ C’, (A, B) CW, , andec € E". For each x € E” and each 
y 2 Olet 


Q(z) — We — vVQ@)] 
g(@, 7) = x || VQ@) Il 


L, y = 0; 





y > 0; 


where Q(x) denotes the gradient of Q at the point x. Then, if 6 is any real 
number satisfying 0 < 6 < 4, and 7f x° 7s an arbitrary point in E”, the 
sequence {x':k = 0, 1, 2, ---} converges to the solution of equation (16), 
where (fork = 0,1, 2, ---) the x” satisfy 


arts = x" in x"7 Q(x"), 


each y* being any real number that satisfies 6 S g(v", y') S$ 1 — 6 if 
g(x*, 1) < 6, ory = lif g(x", 1) 2 6. 


Proof: This proof uses generalizations of some of the ideas in Ref. 7 and 
relies ultimately upon the Goldstein algorithm.” 

We first remark that the sequence {2*} is well-defined: It is easy to 
show (see the first part of the proof of Theorem 1, p. 31, Ref. 11) that 
for each x € £&”, g(x, -) is a continuous function on [0, ©). This being 
the case, it is clear that if g(x”, 1) < 6, then for each £ in the interval 
[5, 1)—and, in particular, for each é in the interval [5, 1 — 6]—there is 
some 7” in the interval (0, 1) such that g(x", y") = é. 

Let S = {2 € EH": Q(z) S Q(x’)}. Using the fact that WM is a positive 
definite symmetric matrix, and using the fact that F € &”", (A, B) CW, 
implies that || AF («) + Bz || > @ if and only if || «|| > © (Theorem 8) 
we have that the set S C E” is bounded. By continuity of Q, S is closed. 
Thus, S is compact and, therefore, the gradient VQ (which is continuous 
on E”, since F € C’) is uniformly continuous on S, and VQ is bounded 
on S. Also, Q is bounded below on S. [Indeed, we have Q = 0 on E” and 
by. the existence and uniqueness theorem, Theorem 3, there exists 
exactly one point x* (2* € §) at which Q(a*) = 0.] 

It is easily verified that, for each x € E”, 


VQ(x) = 2(AD, + B)*M[AF (zx) + Ba — cl, 


where, fork = 1, --- ,, the kth diagonal element of the diagonal matrix 
D, > 0 has the value of the derivative of the function f, , evaluated at 
the point x, . Since (A, B) © W, implies that det (AD, + B) ¥ 0, and 
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since det M + 0, it follows that VQ(x) = 6@if and only if z is the solution 
of equation (16). 

In view of the above, it follows directly from Goldstein’s theorem that 
the sequence {x} converges to the solution of equation (16). O 

Other methods of computing the solution of equation (16), in certain 
cases, also exist. If one performs a transformation of the type (17) on 
the independent variable xz (in theory this can always be done) then the 
solution of equation (16) can easily be computed by first computing the 
solution of an equation of the type G(y) + M~’Ny = M~'c, where 
Ges" and MN € P, . Methods of computing the solution of certain 
equations of this type may be found in Refs. 1-3. 


X. EXAMPLE 


With the aid of the modern computing facilities that are commonly 
available today, it is clearly a rather routine matter to obtain an equation 
of the type (16) for any given transistor network. Moreover, it is not 
unfeasible, even for networks of moderately large size (say, up to 4 or 
5 transistors), to consider the straightforward evaluation of the 2” 
determinants specified in property (2) of Theorem 1, and thereby resolve 
the issue of whether or not the matrices involved in the equation are 
a‘W, pair. Due regard would of course have to be paid to the matter of 
performing sufficiently accurate computations. 

On the other hand, even without the aid of a computer, it should often 
be possible to use a little ingenuity and a few devices* to reduce the 
computations involved in the application of the above theory to many 
specific problems to a point where they will just about fit onto the back 
of an envelope. Consider, for example, the following analysis of a three- 
transistor network: 

For the network of Tig. 5, the voltage and current variables defined 
there must satisfy the following equations: 


is filvs) 
By ai fel02) 

13 eae _ pe) 2 fs@s) (19) 
1s ~~~ -1-2- fsa) 

‘ Pl pos) 

is) fa(0s) 


* According to R. Bellman: “‘a device is a trick that works at least twice.” ” 


NONLINEAR DC TRANSISTOR NETWORKS 1733 








0; 0 oO 0 ! 1 0 0 |{—%, E 

V3 0 R, O ! : 0 || —7 E 

Ne ee Ol. 100) 
tg —-1 -1 0 ! 0 O 0 |) v. 0 
ee 0-1 010 G Gif af fo 

—is 0 0 -1:0 G& GQAlL 4, 0 


where (we are using the transistor model of Fig. 3, withr, = r, = r, = 0) 
each of the 2 X 2 matrices T, k = 1, 2, 3, is of the form (10). A hybrid 
characterization has been used for the linear part of the network. As 
indicated in equation (3), this hybrid characterization can easily be 
converted into a characterization of the Belevitch type. Thus, denoting 
the 3 X 3 blocks of the hybrid matrix in equation (20) by Hi, H12, Ho, 
H.. , in the usual manner, one obtains 


: ee te (21) 
0 —Hy. Ha, _ 


where v = (v; , V3, %,V2,04,U5) and7is similarly defined. We could now 
simply reorder the columns of each matrix in equation (21) in such a way 
that the resulting equation would have the same form, except that the 
subscripts on the components of the vectors v and 7 would occur in the 
natural order (1, 2, 8, 4, 5, 6) and then use that equation, along with 
equation (19), to produce an equation of the type (16) for our network. 
In this example, though, it’s probably easier to reorder the rows and 
columns of the matrix T (recall, T = T © T? @® T) to obtain 
from equation (19) an equation that is compatible with equation (21). 


Thus, 
ol a ee ro, (22) 
=a <J 
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where 
F() = [fs(1), favs), folvs), folv2), falvs), fs(vs)]”, 
and 
P = diag [a,”, a”, a7],  Q = diag [ay”, a7”, a]. 


Eliminating 7 from equations (21) and (22), we obtain 


e : | z "ro ae E EN ies (23) 
H 21 = — Q I 0 —H 22 

Note that since det H,, = det Hz. = 0, itis impossible to put this equation 
into either of the forms (14) or (15). Clearly this would be the same situa- 
tion no matter which ordering of subscripts was chosen for the components 
of v. The cause of the difficulty is simply the fact that neither an impedance 


matrix nor an admittance matrix exists for the linear part of our network. 
Let us determine whether or not the pair of matrices 


A, a —Q I 0 — Ho» 
isa W, pair. We shall try to verify property (1) of Theorem 1. Let 5,, - - -, 5, 


denote arbitrary positive real numbers, and let A; = diag (4; , 6 , 63), 
Arr = diag (64, 65 , 66). We wish to show that 


— me _ 
det { : | ss Ale : | + lz a ~0. 
H, -Ii-Q I JLO Ant lO —H» 
By multiplying the above matrix on the left by the (nonsingular) matrix 


diag (7; , —I3;) and then multiplying on the right by diag (A; , I), we 
obtain the equivalent statement: 


cet H,, =f A; —Fy» a ,,P Arr 


—Fy, 7. Q Ho. ais (I oF HP) Ary 


The 3 X 3 submatrix in the upper left corner is nonsingular and diagonal. 
The 3 X 3 submatrix in the lower left corner can be diagonalized by 
performing a single elementary row operation on the matrix; namely, by 
subtracting 1/(6. + R,) times the second row from the fourth row. 
Having done this, our problem reduces to one of showing that 
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Fg 0 0 A 0 0 | 

0 & +R, 0 -1 —(1 + aR, 85) 0 
det ee oe Tce Oe ee 0 TO a Ra 80) #0. 
! 
_ ow bey ay 1 1 — a,” 8 5 

lat 0 0 a an”) bs + Se CR 0 
0 1 — as” 0 G, + (1 — a!) 5; G, 

L 0 0 1- a | 0 G, G, + (1 — a7”) 86] 


It is easy to verify that whenever det A,, # 0, then 


det ie Fe 20 
As Age 


if and only if det (A22 — AAzjAi2) # 0. In our case both Ay, and A», 
are diagonal and hence we can immediately reduce our problem to: 











an 1 1-a‘? 1—a! 8,6, 
te Merge +R, 7 
det 1—a;” (2) l-a;? (2) 
3 R, G,+(1—a,;”’)6;+ 34h, te R, 6s) G, 0. 
(3) 1—a;” (3) 
0 G, G.+(L—-a7”)be+ 54+Ree te R55) 


It is obvious that the above determinant is always positive. First, note 
that every term in the matrix is nonnegative except, possibly, the 
(1, 2) term, which may be either positive or negative (or zero). In the 
event that the (1, 2) term is positive (or zero), we have 1/(6. + R,) 2 
(1 — af ,65)/(52 + R,), and hence we observe that the matrix is 
strongly row-sum dominant. This implies that its determinant is positive. 

In the event that the (1, 2) term is negative, we do not necessarily 
have dominance; however, considering an expansion of the determinant 
along its first row we see that, because of the assumption that the (1, 2) 
term is negative, the value of the determinant is computed as the sum of 
two positive terms. 

We have thus shown that, no matter which (positive) values are 
assigned to R, , R, , R; , or which values the transistor’s current gains 
assume [0 < at” < 1,0 <a < 1], the pair of 6 X 6 matrices that appear 
in equation (28) is a W, pair. Thus, all of the results concerning a solu- 
tion’s existence, uniqueness, continuity, boundedness, and so on, hold 
for this equation. 
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APPENDIX 


Proof of Part of Theorem 1 


In this appendix we prove the equivalence of properties (2), (22), (v2), 
and (vit) of Theorem 1, which define the class of pairs of matrices Wo . 
We omit the proof of the equivalence of the three remaining properties, 
since those properties are not referred to in this paper. A complete proof 
of Theorem 1 is given elsewhere.” We begin by proving a useful lemma: 


Lemma 1: For each positive integer n the polynomial 
(Co) dy de +++ dy + (i) dy dp +++ dy +++: +) do --> d, 
+ Gnti) dy dy °° dane tb -°* + (Cn(n+1)/2) ds -+* dy +++ Eb (Con-1) 


in the n variables d, , dz, --- , d, is nonzero for all positive values of the 
variables tf and only tf at least one of the coefficients cy , -** , Con 18 NONZErO, 
and all nonzero coefficients have the same sign. 


Proof: (By induction) For n = 1 the statement is obviously true. 
Let N be a positive integer. Then any polynomial of the above type in 
N +.1 variables, (co)d: +--+ dyar + +++ + (Cow+:-1), can be written as 
P(d, , +++ , dy)-dys1 + Q(d: , --- , dy) where P and Q are both poly- 
nomials of the above type in N variables. Then, assuming that the state- 
ment is true forn = N,P+Q ¥ Oand P:Q 2 0 for all positive values of 
the variables d, , --- , dy if and only if at least one of the coefficients 
Co, °** » Cgv+1-, 18 nonzero and all nonzero coefficients have the same sign. 
But, we know that P-dy,, + Q # 0 for all dy., > Oif and only if P + 
Q#OandP-Q20. O 


A.1 Property (1) is Equivalent to (11) 
Let D = diag (d, , --: , d,). By expanding det (AD + B) along the 
first column we have 
det (AD + B) = d,-det P + det Q, 


where the first columns of P and Q satisfy P; = A, ,Q, = B,, and for 
k = 2,---,n,P, =Q, = (AD + B),. Both P and Q are independent 
of d, . We now expand det P and det Q along their second columns, 
resulting in 


det P = d,-det R + det S, 
det Q = d,-det U + det V, 


and hence, 
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det (AD + B) = d,d,-det R + d,-det S + d,-det U + det V, 


where 
R, = A,, iy = Ag, 
S, = A,, So = Bs; 
U,=B8,, U, = Ao, 
V,=B,, Ve = Bs, 
and fork = 3,---,n, 


he = 5, = U, = V, = (AD+ B),. 


Proceeding in this manner until all columns of (AD + B) have been 
encountered, we obtain an expansion of det (AD + B) as a polynomial 
in the variables {d, , d. , --- ,d,} whose coefficients are the determinants 
of the matrices in C(A, B). By using Lemma 1 it thus follows that (7) 
and (22) are equivalent. 


A.2 Property (vt) Follows from (2) and (it) 


According to (iz) there exists a complementary pair of matrices (M/, N) 
taken from €(A, B) such that det M # 0. Let D = diag (d, , --- , d,) > 
0, then det (*N + D) $ Oif and only if det (ZD + NV) # 0. But, 
using property (2), det (MD ai N) = det (AD + B)- det D ¥ 0, where 
the matrices D = diag (d,,--- ,d,) > Oand D = diag (d,, --- » dy) > 0 
are defined by d, = d, and d, = lif M, = A,, and d, = L/d, , Og dy 
otherwise (fork = 1, --- ,n). Thus, W'N € P,. 


A.3 Property (2) Follows from (vt) 


Using the notation above, it is clear that for each diagonal matrix 
D > 0, det (AD + B) = det (MD + N)-det D. Thus, if M7'N E P, it 
follows that det (AD + B) # 0. 


A.4 Property (vit) is Equivalent to (vt) 


Clearly property (vz) follows from property (viz). Thus, we need 
only prove that (vz) implies (viz). Let (7, N) and (P, Q) both be comple- 
mentary pairs taken from @(A, B) with M*N C P, and det P = 0. 


For any D = diag (d,,--- ,d,) > 0, det (P-"Q + D) ¥ Oif and only if 
det (PD + Q) #0. But det (PD + Q) =det (WD +N) -det D #0, where 
the matrices D = diag (d,, --- , d,) > OandD = diag (d,,---,d,) > 0 


are defined by d, = d, andd, = lif P, = M, , and d, = 1/d, dy =i 
otherwise (fork = 1, ---, n). Thus, P’QE P,. O 
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Theorems on the Computation of the 
Transient Response of Nonlinear 
Networks Containing Transistors 


and Diodes 
By L W. SANDBERG 
(Manuscript received June 15, 1970) 


We consider in detail the nonlinear equations encountered at each time 
step when certain wnplicitt numerical-integration algorithms are used. 
In terms of only the properties of the Jacobian matrix of the pertinent set of 
differential equations, we present necessary and sufficient conditions for 
the existence and uniqueness of the solution of the nonlinear equations for 
all continuous forcing functions and any given step size. Since engineers 
often think about dynamic nonlinear transistor network problems in terms 
of the eigenvalues of the relevant Jacobian matrix, the results described are 
of tmmediate conceptual value. In particular, it is possible to carry out 
the algorithms whenever the conditions presented are satisfied. 

Several other types of results are also presented. For example, for a 
special but significant and useful numerical-integration formula, theorems 
are proved concerning properties of the computed sequence such as the 
extent to which the sequence 1s relatively immune to small local errors in- 
troduced at each step as a result of the fact that 1t is ordinarily not possible 
to compute the solution of a certain equation exactly. 

All of the results are concerned with network models that are often used 
in computer simulations. In fact, we heavily exploit some special properties 
possessed by the nonlinear functions associated with such models. 


I. INTRODUCTION 


The set Py of all real square matrices each with all principal minors 
nonnegative plays a key role in the study’ ° of nonlinear equations of 
the form F(x) + Ax = B, and more generally* of equations of the form 
CF (xz) + Ax = B, in which F(-) is a “diagonal monotone-nondecreasing 
mapping” of real Euclidean n-space E” into itself, A and C are real 
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nm X n matrices and B is an element of EZ”. Such equations arise in the 
de analysis of transistor networks, the computation of the transient 
response of transistor networks, and the numerical solution of certain 
nonlinear partial-differential equations. 

In Ref. 3 a nonuniqueness theorem is proved which focuses attention 
on a simple special property of transistor-type nonlinearities. It shows 
that for any transistor-type exponential F(-) the equation F(x) + 
Ax = B has at least two solutions x for some B e &” whenever A ¢ Py. 
The theorem shows that some earlier conditions’'’ for the existence of a 
unique solution cannot be improved by taking into account more in- 
formation concerning the nonlinearities, and therefore makes more 
clear that the set of matrices P, plays a basic role in the theory of 
nonlinear transistor networks. Ref. 3 also contains material concerned . 
with the convergence of algorithms for computing the solution of 
F(«) + Az = B as well as of more general equations, and some related 
problems concerning the numerical integration of the ordinary dif- 
ferential equations which govern the transient response of nonlinear 
transistor networks are considered briefly. 

The primary purpose of this paper is to present the results of a 
continuation of the numerical integration study initiated in Ref. 3. 
Here we further exploit the special property of transistor-type exponen- 
tial nonlinearities used in Ref. 3. 

We consider in detail the nonlinear equations encountered at each 
time step when certain implicit numerical-integration algorithms are 
used, and, in terms of only the properties of the Jacobian matrix of the 
pertinent set of differential equations, we present necessary and suffi- 
cient conditions for the existence and uniqueness of the solution of the 
nonlinear equations for all continuous forcing functions and any given 
step size. Since engineers often think about dynamic nonlinear transistor 
network problems in terms of the location of the eigenvalues of the 
relevant Jacobian matrix, the results described in Section 2.2 are of 
immediate conceptual value. In particular, these results are of a very 
different character than those that appear in the literature, and when- 
ever the conditions presented are satisfied, it is possible to carry out 
the algorithms. Under the assumption that the conditions are satisfied, 
we also show that there are convergent algorithms for solving the non- 
linear equations, and that the Jacobian matrix of the nonlinear equa- 
tions is essentially always at least weakly well-conditioned in a sig- 
nificant sense. 

A part of Section 2.3 reports on a general result concerning conditions 
under which it is possible to invert nonlinear mappings in E”. More 
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explicitly, we show that a proposition proved by G. H. Meyer enables 
us to give a short proof of a new theorem which is a considerably 
stronger result than that described and used in Ref. 11. 

We also present a set of results concerning properties of an important 
class of transistor-diode networks for which certain implicit numerical- 
integration algorithms can be carried out for all values of the step size, 
and, for a special but significant and useful numerical-integration for- 
mula, theorems are proved concerning some properties of the computed 
sequence such as the extent to which the sequence is relatively immune 
to small local errors introduced at each step as a result of the fact that 
it is ordinarily not possible to compute the solution of a certain equation 
exactly. 

Finally, in addition to other results, we present new theorems con- 
cerning the existence of solutions of the nonlinear de equation under 
very realistic assumptions from the viewpoint of models often used 
in computer simulations.t 

Section II contains a detailed discussion of the results and their 
significance. 


II. TRANSIENT RESPONSE OF TRANSISTOR-DIODE NETWORKS AND IM- 
PLICIT NUMERICAL-INTEGRATION FORMULAS 


2.1 Introduction 


We shall consider explicitly only networks containing transistors, 
diodes, and resistors. However, the material to be presented can be 
extended to take into account other types of elements as well. In addi- 
tion, we shall focus attention on the use of linear multipoint integration 
formulas of closed (i.e., of implicit) type, since such formulas are of 
considerable use in connection with the typically ‘‘stiff systems” of 
differential equations encountered. 

A very large class of networks containing resistors, transistors, and 
diodes modeled in a standard manner is governed by the equation’? 


“+ PFICW)|+ GC") = BY, 120 (1) 


Tt Results concerning the dc equation are directly relevant to the problem of 
computing the transient response to the extent that in order to numerically integrate 
the differential equations it is ordinarily necessary to first solve a de problem to 
determine the initial conditions. 

+ As a practical matter, the models of transistors and diodes employed here are 
often used in computer simulations. Of course in some cases it is necessary to use 
more complicated models. 
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with G = GU + RG)" and where, assuming that there are q diodes 
and 7 transistors, 


@) T=T,OT.0:---@T,@AI,, the direct sum of the identity 
matrix of order g and p 2 X 2 matrices 7, in which 


hte 
Te = t oF 
~o I 


with 0 < at” < 1land0 < a <1fork =1,2,---,p. 
@) R=R,OBR@-:-: OR, @ R,, the direct sum of a diagonal 
matrix Ry = diag (r:, 72, °° ,7%,) With r, 2 Ofork = 1,2,---,4q 


and p 2 X 2 matrices R, in which for all k = 1, 2,--- , p 


(k) (k) (k) 
= Te =e Tp Ty 
a 
(k) (k) (k) 
Ty fs ET: 


with r = 0, rf? = 0, and r = 0. (The matrix R takes into 
account the presence of bulk resistance in series with the diodes 
and the emitter, base, and collector leads of the transistors.) 

(at) G is the short-circuit conductance matrix associated with the re- 
sistors of the network. (It does not take into account the bulk 
resistances of the semiconductor devices.) 

(iv) F(-) is a mapping of L°’*® into H°’*® defined by the condition 
that 


F(x) =a [fi:(a1), fo(ae), Ne #9 forrg(Lep+a)]™ 
for all x « H°’*® with each f;(-) a continuously-differentiable 


mapping of i’ into HZ’ such that fi(a) > 0 forallae FE’. 
(v) C~*(-) is the inverse of the mapping C(-), of E°’*® into itself, 


defined by 

. C(x) = ce + rF (x) 
for allae L°*” with c¢ = diag (¢1, ¢2, °** 5 Capeq),) T = Giag (71, 
T2)°** ) T ap+q))) and with each 7; and each c; a positive constant. 


(vt) Bt) is a (2p + q)-vector which takes into account the voltage and 
current generators present in the network, and 
(vit) wis related to v the vector of ideal-junction voltages of the semi- 
conductor devices (v does not take in account the voltage drops 
across the bulk resistors) through C(v) = u for allve H°*? . 
Equation (1) is equivalent to! 
t In Ref. 5 it is shown if B(-) is a continuous mapping of [0, ~) into H@°t®, then 


for any initial condition uw « #°*® there exists a unique continuous (2p + q)- 
vector-valued function u(-) such that u(0) = uw and (1) is satisfied for allt > 0. 
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IV 
o 


u = f(u, t) = O(2p+a) ) t (2) 


in which of course 
f(u, t) = TF[C'(u)] + GC '(u) — BY) (3) 


and 62749) 18 the zero vector of order (2p + q). 
It is well known that certain specializations of the general multipoint 
formula’” 


Yn+1 = 3 AcYu-k ++ h > DiGn—k (4) 
k=0 k=-1 


in which 
Gn—~ = —fYn-n, (X — k)h) (5) 


can be used as a basis for computing the solution of equation (2). Here 
h, a positive number, is the step size, the a, and the b, are real numbers, 
and of course y, is the approximation to u(nh) for n = I. 

In the literature dealing with formulas of the type (4) in connection 
with systems of equations of the type (2), information concerning the 
location of the eigenvalues of the Jacobian matrix J, of f(u, ¢) with re- 
spect to wu plays an important role in determining whether or not a given 
formula will be (in some suitable sense) stable. In particular, an as- 
sumption often made is that all of the eigenvalues of J, lie in the strict 
right-half plane for all £ = 0 and all wu. For f(u, ¢) given by equation (3), 
we have 


ee deat fHlg;(e,)] | 1 
heres tf igw@i Coole Saki 








in which for j = 1, 2, ---, (2p +. q) g;(u;) is the jth component of C™'(u). 
Thus here J, is a matrix of the form 
TD, + GD (7) 


where D, and D, are diagonal matrices with positive diagonal elements. 
A simple result concerning (7), Theorem 4 of Ref. 3, asserts that if there 
exists a diagonal matrix D with positive diagonal elements such thatt 


(t) DT is strongly column-sum dominant, and 
(it) DG is weakly column-sum dominant, 
then for all diagonal matrices D, and D, with positive diagonal elements, 


t The terms “strongly-column sum dominant” and “weakly column-sum domi- 
nant’’ are reasonably standard. However, they are defined in Section III, 
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all eigenvalues of (7) lie in the strict right-half plane. This condition 
on T and G is often satisfied.t 

The subclass of numerical integration formulas (4) defined by the 
condition that b_, > 0 are of considerable use’ *° in applications involv- 
ing the typically “stiff systems” of differential equations encountered 
in the analysis of nonlinear transistor networks. With b_, > 0, yn+1 is 
defined implicitly through 


Yns+1 Se hb_sfYn+1 ’ (n + 1)h) = pa AxYn-k fe h > D.Gn-k 


in which the right side depends on y,-, only for ke {0, 1, 2, --- , r}, and 
for f(u, t) given by equation (8), we have 

Yasr + hb_a{TFIC™ Yne1)] + GC Yast} = Gn (8) 
in which 


dn = pS AiYn-e +h dy biGn—-e + ADsBl(n + Al. 


Obviously, the numerical integration formula (8) makes sense only if 
there exists for each n a Y,412¢ E°*® such that (8) is satisfied. 


2.2 The Jacobian Matrix J, and Necessary and Sufficient Conditions for 
the Existence of a Unique Solution y,+1 of (8) for All q, © B°?*® 


Here we shall make the additional assumption that the functions 
f;(-) are such that the mapping /’(-) belongs to the set 5§7?*” defined 
in Section 3.1. This assumption is satisfied whenever the f;(-) are the 
usual Ebers—Moll exponential-type nonlinearities. That is, 5{??*” con- 
tains all of the mappings F'(-) such that for each 7 


fi(x;) = a,lexp (b;7;) — 1] or f;(a;) = afl — exp (—b,2;)] 


for all x; « E’ with a; and b; positive constants. 

Our first result, Theorem 1 of Section JII, is a rather strong result 
concerning the relation between properties of the Jacobian matrix J, 
and properties of equation (8). Let = denote the set of all real numbers 
o such that det (oJ + J.) = 0 for some ue L°*”. In other words, let 
= denote the set of all real numbers o such that —o is an eigenvalue of 
J, at some point u. According to Theorem 1, equation (8) possesses a 
unique solution y,.1 for each g, e E°?*® (and hence each B[(n + Lh] e 
E°?*®) if and only if (hb_1)~* ¢ =, and also if (Ab_,)~* « = then equation 
(8) possesses at least two solutions for some q, e H°?*® (and hence for 


t See Ref. 5 for examples, 
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some B[(n + lh] e E°?*”). Therefore, in particular, equation (8) pos- 
sesses a unique solution for all gq, e Z°”*® and all he (0, Aj, in which h 
is an arbitrary positive constant, if and only if the intersection of the 
interval [(hb_1)~*, ©) and & is the null set, and equation (8) possesses 
a unique solution for all q,e #°?*® and all h > 0 if and only if = contains 
no points of the interval (0, ©). Finally, as a somewhat peripheral 
matter, according to Theorem 1, the de equation TF (v) + Gv = B has 
at most one solution v for each Be H°?*” if and only if 0 ¢ &. 

The statements made in the preceding paragraph are surprising to 
the extent that on the one hand they are rather definitive and on the 
other hand they involve only the location of the real eigenvalues of 
J,,.+ Since engineers often find it helpful to think about nonlinear 
systems in terms of the location of the eigenvalues of a pertinent 
Jacobian matrix, it is also of interest to note here that equation (8) can 
possess more than one solution y,., for some g, and some h > 0 only if 
the transistor-diode network is locally exponentially unstable at some 
operating point, that is, only if at some operating point u, —J,, has a 
real positive eigenvalue. 


2.3 Existence of Convergent Algorithms for Computing the Solution of (8) 


Throughout this section we assume that the f;(-) are such that the 
additional condition that F'(-) e F°?*® is satisfied. 

Whenever (Ab_,)~* is not contained in the set = of Section 2.2, equation 
(8), which we shall write as Q(Yn+1) = Q,, possesses a unique solution 
Yn+1 for any q, ¢ H?*®. We show here that when (hb_,)~* ¢ Z and each 
f;(-) is twice continuously differentiable on EZ” ,* there exist steepest 
descent as well as Newton-type algorithms each of which generates a 
sequence in H°”*” which converges to Yn: - 

Assume that (hb_,) ¢ #. The Jacobian matrix (J + hb_,J,,,,) of Q(-) 
satisfies 


det 7 + Ab_J,,.,.) #90 forall yuieH??*. (9) 


Hence Q(-) is a local homeomorphism on E‘*”*® and since there exists 


a unique Y,+,¢ 2°*® such that Q(Yns1) = gq, for each q, 2 E°’*”, Q(-) 


t Indeed, while we can write (8) as Q(Yynii) = gn with Q(-) a continuously-dif- 
ferentiable mapping of H°?+® into itself with Jacobian matrix (I + hb_iJy,,,) recall 
that for R(-) a general continuously-differentiable mapping of EH” into itself with 
Jacobian matrix J, det J ~ 0 throughout E” does not imply that (and is not implied 
by the statement that) for each x ¢ EH” there exists a unique y © H#” such that 
E(y) = 2, even forn = 1. 

+ This differentiability condition is obviously satisfied if the f;(-) are the usual 
exponential functions. 
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is a homeomorphism of Z°’*” onto itself. Thus, with ||-|] any norm 
on Eero 
3 


lQ@) || > 2 as |ly|J 7.4 


Let R(-) be defined by the condition that R(y) = Q(y) — q, for all 
ye E*® | Then R(-) satisfies || R(y) || > © as || y || ~ © and the 
determinant of the Jacobian matrix of R(-) does not vanish throughout 
E°?*® | Therefore, assuming that R(-) is twice continuously differenti- 
able on E'?*® , it follows (see the Appendix) that the solution y,4, of 
R(Yn+1) = 92p+e) Can be computed by using certain steepest descent or 
Newton-type algorithms. 


2.4 The Jacobian Matrix I + hb-id,,,,), and Inversion of Nonlinear 
Operators on E” and Jacobian Matrices 


As in Section 2.3, let the additional condition that F(-) « §{?*® be 
satisfied and let Q(-) be the mapping of H°”*” into itself with the prop- 
erty that equation (8) can be written as Q(yn1) = gq, . According to 
Theorem 2 of Section III the Jacobian matrix (J + hb_,J,,,,) possesses 
the property that there exists a constant e > 0 such that 


det (I + hb-iJ,,,.) 2 forall ge,2B*? (10) 


if and only if the matrix 
[(hb_1)~*r + T]"'[(hb_,)~*c + GI, 


which we shall call S, belongs to the set P of all real square matrices 
each with all principal minors positive. Thus when S e P the matrix 
(I + hb_iJ,,,,) is well conditioned in at least the weak sense of (10). 
This fact is of some interest for two reasons. First, certain standard 
algorithms require that the matrix (J + hb_,J,,,,) be inverted along a 
sequence of points {y,1} in order to compute the solution y,., of equa- 
tion (8), and, secondly, Theorem 3 of Section III shows that if 
det [(hb_1) "J + J.] 4 0 for allue Z°*” and all (hb_,)~™* € 9’ in which 
g’ denotes either (0, ©) or any interval contained in (0, ©), then Se P 
for all but at most a finite number of points (hb_,)~* contained in g’. 
Therefore, referring to the material of Section 2.2, if Q(Yn11) = Qa Pos- 
sesses a unique solution y,,, for all g, e H°’*® and all (hb_1)7’ « 9’, then 
(I + hb-.J,,,,) 18 at least weakly well conditioned at all but at most a 
finite number of points contained in 9’. 


t Since Q(-) is a homeomorphism of #°?t2 onto itself, Q(-)~! exists and is contin- 
uous. Therefore, the image of any closed ball in H@?+® under Q(-)~! is contained in 
some closed ball in H@?*®, and hence || Q(y) || — ~ as |ly || — ©. 
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Since the elements of (I + hb_,J,,,,) are bounded on y,.12¢ H°?™, it 
follows from a theorem described by M. Vehovec™ that for each q, € 
B°?*® there exists a unique y,41 2 2°?*® such that Q(Yyns1) = q, if Se P. 
More explicitly, the theorem described! by Vohovec asserts that if R(-) is 
a continuously-differentiable mapping of Z” into Z” with J(R), the Jaco- 
bian matrix of R(-) at an arbitrary point q « EL”, if the elements of 
J(R), are bounded on £”, and if there exists a positive constant ¢ such 
that det J(R), = ¢ for all ge L”, then R(-) is a homeomorphism. Thus, 
using the theorem of Ref. 11 and Theorems 2 and 3 of Section III, 
we are able to show that if det [(hb_,) "J + J,] ¥ 0 for all we B°"*? 
and all (Ab_,)~* « 9’, then for all but at most a finite number of points 
(hb_,)~' e 9’, (8) possesses a unique solution y,,, for each g,2e H°?*®, 
Although this result is obviously much weaker than the existence 
proposition presented in Section 2.2, it shows that the theorem of 
Ref. 11 can be exploited to provide some insight in connection with 
the specific problem considered here. 

The theorem of Ref. 11 is of interest primarily because the key hy- 
pothesis concerns only the determinant det J(2), (as opposed to the 
condition of Palais? that || R(q || — © as ||q|| — ©). Theorem 4 of 
Section III is a general result which is considerably stronger than the 
theorem of Ref. 11. It shows that the condition of the theorem of Ref. 
11 that there exist a positive constant ¢ such that det J(R), = « for all 
q can be replaced with the condition that there exist real constants a > 0 
and b = 0 such that 


det J(R), = forall qe”. 


1 
a bll-@ || 
2.5 A Class of Networks for Which (8) Possesses a Unique Solution for 
All Values of the Step Size 


There is an interesting class of transistor-diode-resistor networks 
with the property that for each network in the class, equation (8) pos- 
sesses a unique solution for all h > 0 (i.e., for allh > 0, all g,2e H°?*® , 
and all diagonal matrices c and 7 with positive diagonal elements). In 
order to define and discuss that class, consider the de equation TF (v) + 
Gv = Bin which v is the (2p + q)-vector of semiconductor ideal-junction 
voltages and Be H°*® | If p > 0 and the matrix R of Section 2.1 is 
the zero matrix, v, is the emitter-to-base voltage of transistor one, v2 
is the collector-to-base voltage of transistor one, and so forth. By port 

* According to Vehovec, the theorem was recently proved by I. Vidar, and the 


proof is expected to appear in the journal Glasnik Matematicki. 
+ See Ref. 12 and the appendix of Ref. 18. Here ||-|| denotes any norm on E*, 
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j of the transistor-diode-resistor network we mean the terminal pair 
between which the voltage v; appears. Again we shall make the assump- 
tion that F(-) 259?" . 

In Ref. 3 it is proved that TF(v) + Gv = B possesses at most one 
solution v for each Be E”*® if and only if T-’G e P, . It is also proved 
in Ref. 3 that equation (8) possesses a unique solution y,1 for each q, © 
E°?*® ond each h > Oif M~’G e P> for all M ¢ 3(T) in which here 3(T) 
denotes the set of all real matrices having the same form as 7' and with 
the ‘“a’s” of M not larger than those of 7'.t In other words, it was also 
proved in Ref. 3 that equation (8) possesses a unique solution y,,; for 
each q, ¢ E°?*® and each h > 0 if the de equation possesses at most 
one solution for each B e H°”*® for “the original set of a’s as well as 
for an arbitrary set of not-larger a’s.’’ Before proceeding, and for the 
sake of completeness, we mention here that the same result can be ob- 
tained by way of the approach of Section 2.2; a direct corollary of The- 
orem 5 of Section III, Corollary 1, shows that if M~’G e P, for all M e 
5(T), then det (cI + J.) ¥ 0 for all realo = Oandallue H?™ . 

Theorem 5 of Section III provides considerable information concerning 
the nature of the class of networks for which M~’G ¢ P, for all M ¢5(T). 
In particular, the theorem shows that M~'G e« P, for all M e 3(T) 
if and only if M~’G « Py for all M e 3,(T) in which 3,(T) is the set of 
all 2”? real square matrices M having the same form as J and with 
each “a’’ of M either zero or the corresponding “a” of 7.’ The the- 
orem also shows that ‘“M~*G e Py for all M « 3(T)” is equivalent to 
each of six other statements involving 7’ and G. For example, according 
to Theorem 5, we have M~’G ¢ P, for all M ¢ 3(T) if and only if either 
T(G + D) « P, for all diagonal matrices D with positive diagonal 
elements, which has an obvious network interpretation in terms of the 
addition of resistors to the network characterized by G, or T’G « P, 
and (T,,) °G, ¢ Py for all pairs of matrices 7’, and G,, obtained from T 
and G, respectively, by deleting an arbitrary set w of rows, and the same 
set of columns, of both 7 and G. 

When the matrix R of Section 2.1 is the zero matrix, the last condition 
on T and G of the preceding paragraph also has a simple network 
interpretation: Given T and G, we have T-’G « P, , and any network 
obtained from the network characterized by T and G by short-circuiting 
an arbitrary set w of at most all but one of the (2p + q) semiconductor 
junctions possesses the following property. With respect to the voltage 
vector v, associated with the junctions not short-circuited, and with 


t See Definition 4 of Section III for a precise definition of g(T). 
+ See Definition 5 of Section III for a precise definition of Jo(T7’). 
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the components of v, taken in the same order as those of v, the ‘“‘new 
T and @’ matricest T,, and G, satisfy (7'.,)"’G, 2 Py) . As reasonable 
as this condition or any of the other seven equivalent conditions of 
Theorem 5 might seem, and even though, as Theorem 6 of Section III 
shows, 7’ "Ge P, implies that (7',)~*G,, « Py whenever w has the property 
that if the port number associated with one junction of a given transistor 
is contained in w, then the port number associated with the other 
junction of that transistor is also contained in w, it is the case that 
there are transistor-diode-resistor networks for which 7”’G ¢ P, and 
M~'G ¢ P, for some M ¢ 3(T). In fact, Ref. 14 presents an example in 
which p = 3,q = 0, T'Ge P,, and T7'(G + D) ¢ P) for some diagonal 
matrix D with positive diagonal elements. However, the class of networks 
for which T~’G ¢ P, implies that M~'G e P, for all M e 3(T) is clearly 
quite large; it obviously includes all networks in which p = 0, it includes 
all networks in which the base terminals of all transistors are connected 
to a common point, and as Theorem 7 of Section III shows, the class 
includes all networks in which T7’G e P, and p = 1 or p = 2.14t 


2.6 Results Concerning the Numerical-Integration Formula Yau: = Yn + 
hGn+1 
The general multipoint formula (4) reduces to the well-known 
implicit numerical-integration formula Yi, = Yn + hGa+, When da = 
b_, = 1,b) = 0, anda, = b, = Ofork = 1, 2,---,7r. For that important 
special case, and with %,,, given by equations (3) and (5), {Yyns1:} is 
defined implicitly through 


Yner + ALTFIC™ (Ynss)] + GC (Ynes)} = Yn + AB, (11) 


for alln = 0, in which B, = Bl(n + 1)hA]. Here we describe some detailed 
results concerning the relation between the sequences {y,:,} and {B,}. 
We assume throughout this section that G is such that there exists a 
diagonal matrix D with positive diagonal elements with the property 
that both DT and DG are strongly column-sum dominant. This condi- 
tion, which is often satisfied,’ guarantees that there exists a unique 
solution? y,., of equation (11) for each (y, + AB,) « B°*®. 


1 It is a simple matter to show that the ‘new T and G’’ matrices are 7’, and Gy. 

tt It is proved in Ref. 14 that if g = 0 andif p = lorp = 2, then TG e Py 
implies that T—1(G + D) ¢« Po for all diagonal matrices with positive diagonal ele- 
ments. Thus, by the equivalence of statements (7) and (v) of Theorem 5 of Section 
ITI, it follows at once that if TG ¢ Py) then MG ¢« Po for all M 2 §(T) if qg = 0 and 
p = 1lorp = 2. The proof of essentially the same end result given here is of a very 
different nature and is quite short. 

§ See Ref. 5 for examples. 

t A result mentioned in Section 2.1 implies that if DT’ and DG are both strongly 
column-sum dominant, then det [(h)-W + J.J + Ofor all ue H@Pt® and all h > 0. 
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Let ||-||: be defined by the condition that || v ||; = S>2*® | v; | for all 


7=1 
ve E°*® According to Theorem 8 of Section III, there exists a positive 
constant 6 depending only on the c; , the 7; , T, G, and D such that 


|| Dy Ths SF oI)" | Dyo Th +h DY + GA) | DB [hs 


for all nm 2 1. Therefore, it follows that for all h > 0, the sequence 


Y1 » Y2 , °** is bounded whenever the sequence B, , B, , --+ 1s bounded, 
and ¥; , Y2, °*: approaches 62,4.) the zero vector of H”’'” whenever 
B,, B., +--+ approaches 6.2p+,)- 


Typically at each step an iterative algorithm is employed to compute 
the solution y,, of equation (11). Since it is ordinarily not possible to 
compute ¥,;,, With infinite precision, it is important to consider the 
effects of the errors which are introduced. While, ideally, we would like 
to determine the sequence {y,.:} defined by equation (11) and some 
initial-condition vector y, , suppose that we determine instead a sequence 
{G41} such that, with e an arbitrary positive constant, || DJ, — y*) ||1 
S eforalln = 1 and 


yt. + ALTPIC  (ys.1)] + GC (yt) } = Gn + AB, (12) 


for all n = 0. That is, suppose that at each step the local error || D(g, — 
y*) ||, in solving for ‘‘y,,,” is at most e. Then, according to Theorem 8, 
and with 6 the positive constant referred to above, 


|| D(Yn — Gn) [1 < (1+ 6h)” || Diyo — Go) I|, 
+. ri + 6h)" forall n=1 


in which % is the approximation to y) . Therefore, given an arbitrarily 
small positive constant p, for any h > 0 it is possible to choose % and 
e > O such that the accumulated-error vector (y, — 9,) satisfies || y, — 
Gn \l, S p for alln = 1. 

Finally, Theorem 9 of Section III provides us with a conceptually 
interesting uniform bound on the norm of the difference between cor- 
responding elements of the sequences {y,} and {u,} in which u, = u(nh) 
for all n = O and u(-) satisfies the differential equation (1). According 
to Theorem 9, there exist positive constants 6 and p, both independent 
of h, such that 


|| Din — Yn) |], S + 8h)™ || Do — yo) |]: + ph 
for all n = 1, assuming that the elements of B(-) and (d/dt)B(-) are 
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bounded and continuous on [0, ©). In particular, if yo = wo we see that 
there exists a positive constant p’, independent of h, such that || u, — 
Yn \|. < eh for all n = 1, provided only that the assumptions of this 
section are satisfied and that B(-) and (d/dé)B(-) are bounded and 
continuous on [0, ~). 


2.7 Conditions Which Imply That T-'G(I + RG)" « Po 


In this section and in Section 2.8 we present some results concerning 
properties of the de equation TF'(v) -+- Gv = B. These results are directly 
relevant to the problem of computing the transient response of tran- 
sistor-diode networks to the extent that in order to numerically integrate 
the differential equation (1) it is ordinarily necessary to first solve a 
de problem to determine the initial conditions. 

As indicated in Section 2.1, 4 = G(I + RG)" in which R takes into 
account the bulk resistances associated with the semiconductor devices. 
Here we present some material concerning conditions which imply 
that T "GI + RG)" belongs to P, . 

Let p > 0. Theorem 10 of Section III asserts that T-'G(I + RG) Py 
whenever T~'G e Py and R satisfies 


(k) (kK)\-1 (4) _ tk) 
a, (l—a, ) Tr.” = 75 

(k) (k)\—-1 (4) (Kk) 
ay (1 — ) Ts = 7, 


fork = 1, 2,--- , p. This rather special result shows that if F(-) satisfies 
the additional condition that F(-) belongs to the set §{??*” defined in 
Section 3.1, and if the network associated with 7’ and G possesses the 
property that there is at most one solution v of the de equation TF(v) + 
Gv = B for each Be E”*®, then it is always possible to add certain 
resistors of positive value in series with each transistor lead such that 
the de equation of the resulting network possesses at most one solution. 

Theorem 11 of Section III directs attention to the fact that there is 
a nontrivial class of transistor networks for which T~’G(I + RG)" e Py 
for all R. According to Theorem 11, if p > 0 and G is such that T "Ge P, 
for all ‘‘a’s” (i.e., for all a” and a” belonging to (0, 1)), then for any 
particular set of “a’s” T'G(I + RG)7' e P, for all R.t 

Given T, an interesting characterization of the class of short-circuit- 
conductance matrices G such that M~'G e P, for all M e 3(T) is provided 
by Theorem 12 of Section III.t According to Theorem 12, M~'G e P, 
for all M e 3(T) if and only if T’GU + RG)" « P, for all R satisfying 
certain inequality-type conditions. In particular, if the base-lead 


t A similar result is proved in Ref. 2 under the assumption that G is not singular. 
+ The set J(7) is described in Section 2.5. 
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resistance of each transistor is taken to be zero, then WM “14 ¢ P, for all 
M ¢ 5(T) implies that T~'G(I + RG) « P, for all nonnegative values 
of each emitter-lead resistor and each collector-lead resistor. 


2.8 Ebers-Moll Models and the Existence of a Solution of TF(v) + Gv = B 


In Section III, a set §; of mappings F(-) is defined such that each 
element of 5; possesses certain important properties possessed by an 
arbitrary F(-) of the type that arises when an Ebers-Moll exponential- 
nonlinear-function model is used for each transistor and diode. In 
contrast with the set of all F(-) such that each f;(-) is a strictly-mono- 
tone-increasing mapping of I’ onto E” , an arbitrary element F(-) of 
$5 possesses the properties that for each j, f;(-) is bounded on either 
[(0, ©) or (— ~, O], and the two nonlinear functions associated with the 
same transistor are both bounded on either [0, ©) or (— ~, O]. The set 
¥5 is contained in §{’?*” and contains every Ebers-Moll exponential- 
nonlinear-function-type F(-). 

The first part of Theorem 18 of Section III asserts that the equation 
TF(v) + Gv = B possesses a unique solution v for each F(-) ¢ ¥; and 
each Be H”*” if and only if 77'G e P, and det G ¥ 0. It is the “only if” 
part of this proposition which is the new result presented here. The 
proof exploits some special properties of transformerless resistor net- 
works; it shows that if T~’G e P, but det G = 0, then there are functions 
t(-) and d(-), both functions taking on only the values 1 or —1, such 
that there is no solution v of TF(v) + Gv = B for some Be E°?*® for any 
set of Ebers-Moll-modeled transistors and diodes with the property 
that for all k transistor k is a pnp device (as opposed to a npn device) 
if and only if ¢(k) = 1, and for all j diode j is a p-n junction if and only 
if dij) = 1.7 

The discussion of the preceding paragraph concerning the proof of 
Theorem 13 shows that it is not possible to make stronger assertions 
concerning the existence of a unique solution of TF(v) + Gv = B for 
all B e« E°*® for Ebers-Moll-modeled transistors and diodes unless 
we take into account more information about the nature of the semi- 
conductor junctions. A good deal of progress in this direction has 
recently been made, and we state here without proof the following 
complete result dealing with diode-resistor networks. 


Theorem 14:' Let p = O and q > 0. Let F(-) ¢ Ss (see Definition 12 of 


t In contrast, the proof of the “only if” part of Theorem 3 of Ref. 1 shows that if 
A ¢P,then there is a mapping F'(- ) with eachf;(-) a linear function such that F(x) + 
Az = B does not possess a unique solution for all B « E*. 

+ The proof of Theorem 14 will be presented in a subsequent paper. 


NONLINEAR NETWORKS 1753 


Section 3.31), and for 7 = 1, 2, --- , qlet s; equal either 1 or —1 depending 
on whether f ;(-) is bounded on [0, ©) or (— «©, 0], respectively. Then, with A 
any real symmetric nonnegative-definite matrix of order q, there exists a 
unique solution v of F(v) + Av = B for all B « E* tf and only tf there is no 
real q-vector n such that n ¥ 6,, An = 0,, and ne S, in which 


S = {y:ye £ and y,;s; 2 Oforj = 1,2,--- , gq}. 


III. THEOREMS AND PROOFS 


3.1 Notation and Definitions 
Throughout Section III, 


(z) unless stated otherwise, p and q denote nonnegative integers 
such that (p + q) > 0, and n denotes an arbitrary positive 
integer; 

(zz) the set of all real n-vectors is denoted by E”, 6 is the zero element 
of £”, and if ve H” and j is an integer such that 1 S 7 S n, then 
v; denotes the jth component of v; 

(iii) |v | = (O31 0)” and || v |) = Ths | v; [for allo e "for any 
realn X n matrix M, || AZ || denotes sup {m: || Mz || S m |] x ||, 
roe 3 il 

(2v) the transpose of an arbitrary (not necessarily square) matrix 
M is denoted by Mt; 

(v) I, denotes the identity matrix of order n, and I denotes the 
identity matrix of order determined by the context in which 
the symbol is used; if Q, , Q2, --- , Q, are square matrices, then 
Q,0Q.0 --- @ Q, denotes the direct sum of Q:,Q:, °°: , Qn, 
in the order indicated; 

(vt) if D is a real diagonal matrix, then D > 0(D 2 0) means that 
the diagonal elements of D are positive (nonnegative); and 

(viz) we say that a realn X n matrix M is strongly (weakly) column- 
sum dominant if and only if for7 = 1, 2,--- ,n 
m;; > (2) pz | m.; |. 
Definition 1: The set of all real square matrices MV such that every princi- 
pal minor of J is nonnegative (positive) is denoted by Po(P). 


Definition 2: Let §2?*? denote that collection of mappings of E°?*” 
into itself defined by: F(-) ¢ 5§°?*® if and only if there exist for 7 = 


* In the network case, A = G, and it is often possible to determine by inspection 
whether or not there exists an y ~ 6, such that Gy = 0,and 72S, 
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1, 2, --- , (2p + q) continuous functions f;(-) mapping E’ into E* 
such that for each xe H°?*®, F(x) = [f,(a1), fo(te), +++ y feapeg (Leapray)) 
and 


() inf litle + 8) — f,(@)] = 0, 


ae(—-0,0 


(v2) sup (f(a + B) — f;@)] = +2 


ae(—0,0 


for all 8 > O and all7 = 1, 2, --- , (2p + q). 


Definition 3: Let 5 denote the set of all real matrices AZ such that JJ = 
M,0®M.@-:--@M,A 1, with 


__(k) 
M, = 1 ’ 
-a 1 


0 Ss a <1,and0 S a” < 1forallk = 1, 2,--- , p. As suggested, 
ifg = 0,then MW = M,@QOM.Q@---@M,, while if p = 0, then M = 
Das 

Assumption 1: Throughout Section III, G denotes a real nonnegative- 
definite matrix of order (2p + q). 


A tool that we shall use often is: 


Lemma 1: A real square matrix M is an element of Py if and only tf det 
(D + M) # 0 for all real diagonal matrices D > 0. 


Lemma 1 is proved in Ref. 2. 
3.2 Theorem 1: Let F(-) 2 5$7?*® with each f;(-) continuously differentiable 
on (—«, ©) and fi(a) > 0 for allae (—~%, ). Let T « 5, let C(-) 
[that 1s, c + rF(-)], G, and J, be as defined in Section 2.1, and let c be a 
real nonnegative constant. Then 
oy + TFC '(y)] + GC") = r (13) 


(2p+a@) 


possesses at most one solution y for each re E uf and only tf 


det (I + J.) #0 forall we B°*”, (14) 


and if ¢ > 0 and condition (14) is satisfied then for each re E°?*” there 
exists a solution y of (18). 


3.3 Proof of Theorem 1 
We have 
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det (oJ + J.) 
= det (of + TF’[gw\{e + rF’[gw]}~ + Gle + rF’[g@)]}") 
= det {e + 7F’[g(u)]}"*-det {oc + o7F’[g(u)] + TF’ [gu] + G}, 
in which g(-) is the mapping of E°*® onto itself defined by g(u) = 


C7'(u) for all ue E°?*®, and F’[g(u)] = diag {f*Ig;(u;)]}. Since det 
fe + rF'[g(u)]} > 0 for all u, det (oJ + J,) ¥ 0 for all u if and only if 


det{(or + T)F’[g(u)] + (cc + G)} 40 for all wu. 


For each j g;(-) maps E’ onto E", and since F(-) ¢ $°?*” with each 
f;(-) continuously differentiable on (—©, ©) and fi(a) > 0 for all 
ae(—%, ©), the image of Z’ under the mapping f/[9;(-)] is (0,)t 
for all 7. Thus, by Lemma 1 (since det(or + 7) # 0)(or + T)* (oc + 
G) e P, if and only if 


det(oI + J.) #O forall wu. (15) 
The equation 
oy + TFC (y)] + GCC) =r 
possesses a solution y if and only if x = C”*(y) satisfies 
oC(x) + TF(a) + Gr = 7, 
that is, if and only if 
(or DFG) - eb G)a =r. (16) 


But equation (16) possesses at most one solution for each 7 e H°?*® 
if and only if (or + T)""(cc + G) « Py (see pp. 105-107 of Ref. 3) and 
hence if and only if condition (15) is met. 

Suppose now that o > 0. Since G is nonnegative definite, det(ac + 
G) # 0. If condition (15) is satisfied then (or + T)"(oc + G) e Py 
and hence for each r e E°*®, equation (16) possesses a solution x 
(see p. 99 of Ref. 3).0 


3.4 Theorem 2: Let T «5, and let F(-) ¢ &{°?*® with each f;(-) continuously 
differentiable on (— ©, ~) and fi(a) > 0 for allae (—~, ~). Then for 
each « = O there exists a positive constant « such that det(oI + J.) 2 e 
for allue E°’*® if and only if (or + T) ‘(oc + G) e P. 


t+ For any 6 > Oand any ae (—.o, ~), f;(a + 8B) — f(a) = Bf;'(5) for some 
dela, a + Bl. 
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3.5 Proof of Theorem 2 
We have 
det (oJ + J.) 
= det (of + TF'[g@)]{e + rF’[g@]}° + Gle + 7F'Ig@]}"’) 
= det {ce + rF’[g(w]} '-det {(or + T)F’[g(u)] + (ce + G} 
sete ee (17) 
(ce; + rsfilgi(es)]) 


in which A = (o7 + T)"(oc + G). 

For each sequence €; , €2, °** , €c2p+q) With each e; either zero or unity 
and €;, €,°** , €c2»+g) not the sequence 1, 1, --- , 1: let m,,,.,,..., 
e(ap+a) Genote the determinant obtained from A by deleting rows 
Pi, P2,°** ,p, and columns p; , p2, °° , p: in which {p1, P25 <— » prt <= 
{j: €; = 1}. Thus for each sequence e; , €2 , -** , €c2p+q) Other than the 
sequence 1, 1, --- , 1 M2, ,¢,,+++,e¢sp4q) 18 & Principal minor of A. Let 
M1,1,...,.1 = 1, and let d; = filg;(u;)] for all 7. Then by a standard 
expression’ for the determinant of the sum of two matrices 


det (F'[g@)] + A) = D0! ditdst «++ G2} ma, esse setae 


in which >,’ denotes a summation over all 2°*® 


€c2p+q) and d; = 1 for all j. It is clear that 


(2p+a) 


IL G+ rif'lotud) = DL! dtd + APR e aa een 


sequences €,,€2, °°", 


in which each ¢,,,¢,,-+-,e(27+¢) 18 & Positive constant. Thus with 7 = 
det(or + T), 


_ , dq?! Lt ore 6 (apta) an . 
n 1 det (ol ae J.) = bs 12 (2p+a) "e1,€2,°**,€(apta) . (18) 
. 761 Jee e(2ptq) 
De dy*da" +++ DspraCes.es,- *,€(apta) 


Suppose that all principal minors of A are positive. Then there is a 
positive constant 6 such that 





> 
Mex .e2,*%*,e(ap4a) — OCs, 0a, *** (an ¥e) 


for all e,; , 2, -** , €cop+q) and hence (since d; > 0 for all 7) det(oZ + 
J.) = nb for allue E'?*”. 

As in the proof of Theorem 1, the range of each d; = filg;(u;)] is (0, ©), 
and for any positive constants p: , Po, °** , Prapiq) there exists a ue 
E°’*® such that d; = p; for all j. If A ¢ P then at least one principal 
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minor of A is not positive. If A ¢ P, , then det(F’[g(u) + A]) = 0 
for some u. Therefore to complete the proof it is sufficient to show that 
if Ae Py but A ¢ P then there is no constant « > 0 such that det («7 + 
J.) > ¢ for all wu. 

With Ae P, and A ¢ P, for at least one sequence e{ , €3 , -** , Clonta) 


Mer, e'a.ees.e' (speed 0. 
If det A = mo o.....0 = 0 we have 
inf det (oJ + J.) = 0 


use (apta 
since det(ol + J.) ~ 0 asd; — 0 for all 7. Suppose now that det A > 0 
and that m,>, 6,,-++.e’¢ap+¢) = 0 for some sequence ef , ef, °-- , Clopag) - 


Then with d; = d for all j for which ef = 1 andd; = d™ for all j for which 
e/ = 0, we have [see equation (18)] det(oJ + J.) ~0asd— ~.0 


3.6 Theorem 8: Let T ¢ 3, let F(-) ¢ §°?*® with each f;(-) continuously 
differentiable on (— ©, ©) and fi(a) > 0 for allae (—@, ~), and let 
J denote [0, ©) or an interval contained in [0, ©). Then for all but at most 
a finite number of points o contained in J, there is a real constant «, > 0 
such that det(oI + J,) = ¢. for allue E°?* if and only if det(oI + J.) # 
0 for allo eS andallue Bo. 


3.7 Proof of Theorem 3 


As in the proof of Theorem 1, (st + T)7*(cc + G@) e Py for all ce J 
if and only if det(oJ + J.) # 0 for all ce 9 and all u. We shall also use 
the fact that since det(cr + 7) > 0 for all « = 0, each principal minor 
of (cr + T)~*(ce + G) is a finite-valued rational function of o for all 
go = 0. 

(if) If det(oI + J,) = 0 for all wu and all ce J, then (or + T) "(oc + 
G) e Py for all ce J. It is clear that (or + T)*(oc + G) « P for all suffi- 
ciently large « > 0. Thus each principal minor of (or + T)~*(oc + G) 
is nonnegative for all o « J and is positive for all sufficiently large « > 0. 
They are therefore positive for all but at most a finite number of values 
of ce J. Thus, by Theorem 2, if det(oJ + J,) 0 forall oe and all u 
there exist for all but at most a finite number of points o ¢ J a positive 
constant e, such that det(coJ + J.) 2 «, for all u. 

(only if) If det(oI + J.) = 0 for some o « J and some uw, then, for 
that o, (cr + T)7*(oc + G) ¢ Py. That is, for that o at least one principal 
minor of (or + T)~*(cc + G) is negative. This means that (or + T)~ 
(oc + G) ¢ Py for all o contained in some interval 3’ C J, and by Theorem 
2, for all oe 9’ thereis no e, > Osuch that det(7 + J.) 2 ¢, forallu. O 
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3.8 Theorem 4: Let R(-) be a continuously differentiable mapping of 
E” into i”, and let J(R), denote the Jacobian matrix of R(-) at an arbitrary 
point qe Ei”. If the elements of J(R), are bounded on E”, and tf there exist 
real constants a > O and b = 0 such that det J(R), = (a + b || q ||) for 


all q 2 E”, then R(-) is a homeomorphism of E” onto E”. 


3.9 Proof of Theorem 4. 


If Ref. 16 Meyer proves that R(-) is a homeomorphism of 1” onto 
iE” if J(R)z* exists for all q « EZ” and there exist real constants a > 0 
and 6B > 0 such that || J(R);z* || < «@ + || q|| for all ge &. 

With q an arbitrary element of Z”, let \,; , \2 , +: , \, denote the 
eigenvalues of J(R)i’J(R), , and let \, = min,{A;}. Then \,\, --- A, = 
[det J(R),]? = (a + b || q||)-*, and since the elements of J(R), are 
bounded on E”, there is a constant \ > 0 such that \; S 2 for all 7 and 
all ge £”. Thus 

Og)? > he ENG a b|| q (Pe (19) 

for all g. For any xe HE” and any qe EL”, cS (Rk) I (R) x = d,x'"a; that is, 
ll FR).# || S ON] e || BANOO | + Sl g [DT IL 

With « = J(R)7’y in which y is an arbitrary element of Z”, we have 


FB) yl] See a+ bi] a [I lly, 
which shows that our hypothesis concerning det J(R), ensures that 
Meyer’s condition on ||.J(R);"|| is satisfied. O 
3.10 Some Further Definitions 


Definition 4: For each T ¢ 3, let 3(T) denote the set of all matrices 17 
such that MW = M,@®M.@0:-:-OM,A I, with 


__ (kh) 
M, — : 0, 
— 6,” 1 


0< 5’ <a if ao >0 and o =0 if a” =0, 
0< 3 sal” if a’ >0 and 6” =0 if of =0, 


for allk = 1, 2,--- , p. As suggested, if g = 0, then Z = M, QO MLO 
.® M,, while if p = 0, then M = I, 
t Meyer’s result is a generalization of a well-known result of Hadamard.” Hada- 


mard proved that R(- U) is a homeomorphism if J(#),~! exists for all g « HE” and 
satisfies || J(R)g || S @ for all g e E” for some positive constant a.” 


and 
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Definition 5: For each T ¢ 3, let 3)(7’) denote the set of all 2°” matrices 
M such that ZW = M,QOM.@---OM,A@T, with 


__ glk) 
Tie) 6, 
3 4 


and 
5 = g® ay 3 = 0 
6) = a or & = 0 
for allk = 1, 2,--- , p. As suggested, if g = 0, then VM = M,@®ML@ 


--+ @® M,, while if p = 0, then MW = 7,. 


Definition 6: Let Qro»+.) denote the family of all 2°7*” — 1 sets w = 
{41 t2, +++ , 2,}, ineluding the null set, such that r < (2p + q) and 
we (lero al 2p ee 


Definition 7: For M an arbitrary square matrix of order (2p + q), and 
for each w e Qioy+) , let M,, denote the principal submatrix obtained 
from M by deleting rows 7, , 72, --+ , 7, and columns 7, , 22, -++ , 7,. 
(If w is the null set, then MW, = M.) 


Definition 8: For each j « {1, 2,---,(@p + q)}, let U; denote the 
(2p + q)-column-vector with unity in the jth position and zeros in all 
other positions. 


Definition 9: For each T ¢ 3 and each we Qiap+.) , let T'” denote the 
matrix obtained from T by replacing the jth column of T with U; for 
all 7 € w. 


3.11 Theorem 5: Let T ¢ 3. Then the following statements are equivalent. 


(7) MG e Py for all M « 3(T). 
(ii) (D. + T)7*(Dy + G) ¢ Py for all diagonal D, = 0 and all diagonal 
D, 2 0. 
(iz) T'(G + D) « Py for all diagonal D = 0. 
(iv) (Dz + T)7*(Dy + @) ¢ Po for all diagonal D, > 0 and all diagonal 
D, > 0. 
(v) T"\(G + D) « Py for all diagonal D > 0. 
(vi) (T)"G, ¢ Po for all we Qrapea - 
(vi2) (Ey) Gle é Po for all we Qio+0 ° 
(viii) M~'G « Py for all M ¢ 3,(T). 
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3.12 Proof of Theorem 5 
[(2) and (72) are equivalent] 


By Lemma 1, (D, + T)7'(D, + G) « P, if and only if det [(D, + 7) 
(D, + G) + D] ~ 0 for all diagonal D > 0. Thus (D, + T)7"(D, + @) 


e Py for all D, 2 0 and all D, = 0 if and only if 
det [((D,.D™* + D, + T)D + G] #0 
for all D, = 0, all D, = 0, and all D > 0, and hence if and only if 
det [(A + T)D + G] #0 


for all diagonal A = O and D > 0. Let T, = (A + T)(J + A). Then 
(D. + T)"(D, + G) « Po for all D, = 0 and all D, = 0 if and only if 


det [T, 7 + A)D+ G] 40 


for all A = Oand all D > 0, and hence if and only if det (7,D + G) 0 
for all diagonal D > 0 and all A = 0. By Lemma 1, this means that 
Tx'G « Py for all A = O if and only if (D, + T)*(D, + G) « P, for all 
D, 2 0 and all D, = 0. We observe that T, = (Ty), B (Ta) 2 O° @ 





(T.)>o @ I, in which, with A = diag (Ay, A\2, ++ y Atop+a)s 
__ tk) 
: T+ 
(Pads = a 
(k) 
eet) See 1 
1 + Yox-1 
fork = 1, 2,--- , p. Thus for each A 2 0, JT, ¢ 3(T); and if M is an 


arbitrary element of 3(7), there isa A 2 O such that M = T, . There- 
fore (D, + T)'(D, + @) e« Po for all D, = 0 and all D, = 0 if and only 
if MG e Py, for all M ¢ 3(T). 


[(z) and (zz) are equivalent] 


Repeat the proof of ‘‘(z) is equivalent to (77)’’ with each statement 
that D, 2 0 replaced with D, = diag (0, 0, --- , 0). 


[(2z) and (2v) are equivalent and (zz) and (v) are equivalent] 


Suppose that (zz) and (iv) are not equivalent. Then (D, + T)7*(D, + 
G) e Py for all D, > 0 and all D, > 0, and for some D* = 0 and some 
D* = 0, with Dt + 0 or Dt > Oor Dt > Oand D¥ ¥ 0, (D* + 
T)~*(D* + G) ¢P, . Thus some principal minor of (D* + T)~*(D% + G), 
and hence of (D* + T)'(D% + G) det (D* + T), is negative. Let 
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m(D* , D*) be some negative principal minor of (D* + T)7'(D* + 
G) det (D* + T), and let m(D* + ef, D¥ + el) be the corresponding 
principal minor of (D* + ef + T)7'(D% + ef + G) det (D* + ef + T) 
for all real e 2 0. Thus m(D* + eZ, D* + ec) is a polynomial p(e) in ¢ 
for e 2 0, and p(e) = O for all e > 0. Therefore p(0) = 0, which con- 
tradicts m(D* , D*) < 0. 

A proof that (272) and (v) are equivalent can be obtained by modifying 
the previous paragraph in an obvious manner. 


[(vt) is equivalent to (v)] 


By Lemma 1, 7~'(G + D) « P, for all diagonal D > 0 if and only 
if det [T-'(G + D) + D*] ¥ 0 for all diagonal D* > 0 and D > 0, and 
hence if and only if det (G + TD* + D) # 0 for all D* > O and all 
D > 0. Therefore, by Lemma 1, 77'(G + D) « P, for all D > 0 if and 
only if (G + TD*) e P, for all D* > 0, that is, if and only if det [G,, + 
(TD*),,| = 0 for all we Qi2p+.) and all D* > 0. Since (7-D*),, = T,,D* , 
we see that T~'(G + D) e Py for all D > O if and only if 

det [(T..) "GG, + D*]20 forall weQi,.,. andall D*>0. (20) 
But, by Lemma 2 (which follows) condition (20) is equivalent to the 
condition that det [(7',,)~’G,, + D*] > 0, and hence that det [(7.,)7’G., 


+ D*] ~ 0, for all we Qio,+,) and all D* > 0. Thus by Lemma 1, 77~’(G 
+ D)e P, for all D > Oif and only if (T,,) "G, © Po for all we Qoapia - 


Lemma 2: If A is areal square matrix of order n such that det (D + A) = 0 
for some diagonal D > 0, then det (D + A) < 0 for some diagonal D > 0. 


Proof: Using the notation of the proof of Theorem 2, 

det (D + A) = 0! di? dg? +++ do" my, .e5,0++.6n (21) 
for all D > 0. Since m,,,,...,, = 1, if det (D + A) = 0 for some D > 0, 
then for at least one sequence ef , 66, --- , ef we have m,>, e1,,-++,e'n < O. 
If 19,0,.--,0 = det A < 0, then there exists a positive constant o, such 
that det (D + A) < 0 whenever 0 < d; < «a, for all j. If det A 2 0, 
then, with d; = d for all j such that ef = 1 and d; = d™ for all j such 
that ef = 0, there exists a positive constant co, such that det (D + A) <0 
for alld > o, [see (21)]. O 
[(vz) and (viz) are equivalent] 

We shall prove that 
[((T")G). = (T.)"G, forall we Qeapra - (22) 


Obviously the equality of (22) is satisfied if w is the null set, 
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It is convenient to introduce the following notation. Let wu denote the 
1 X 1 matrix containing the entry 1. Let ¢ denote what might be called 
the empty matrix, a matrix with no rows or columns; by this we mean 
that ¢ is to be interpreted in the following manner: ¢ @ ¢ = 9, 1, = ¢ 
when s = 0, ¢* = g, andif M7, and M, are any two (ordinary) matrices, 
then ge OM, =M,,M,0¢=M,,andM,OeOM.=M,0"N,. 

Let we Qiep+) and let w not be the null set. The matrix 7 can be writ- 
ten as the direct sum T,@® T7.®0--- @T,Q@[,. In terms of u and g, 
Tn =t£.@0L.0---OLO0T1,, in which s = g — ¢ where 7 is the num- 
ber of elements contained in the intersection of the sets w and {2p + 1, 
2p +2,---,2p+q},andfork = 1,2,---,p:t, = 7, if both (2k — 1) 
and 2k are not elements of w, ¢, = gif both (2k — 1) and 2k are elements 
of w, and t = wif either (2k — 1) ew and 2k ¢w or (2k — 1) ¢ w and 
2k ew. Thus (7,)" = &' @®E'@-:: @t' OI,. But (7”)” = 
TT @T@---@ T3' OI,, in which fork = 1, 2,-->, p: 7, = T; 
if both (2k — 1) and 2k are not elements of w, 


n ho } | 
a 
0 1 


if both (2k — 1) and 2k are contained in w, 


eto } i 
Pe 
0 1 


if (2k — 1) ew and 2k ¢ w, and 


a 1 


if (2k — 1) ¢ w and 2k e w. Thus we see that [(T”)™"], = (T,,)7’. Let 
(w) (T'”)~* denote the (2p + q¢ — r) X (2p + q) matrix obtained from 
(T”)~* by deleting rows 7,, 72, --- , 7,. But all elements of columns 
11, tn, *** 5 ty OF (y)(T”) * are zeros, and hence, with G,,,) the matrix 
obtained from G by deleting columns 7, , 72, --- , 7,, 


[(°) "Gu = cw (LP) Eww) 
= [(f") "WG. = (Ty) "Gu. 
[(vizt) and (2) are equivalent] 


If MG e Py for all M ¢ 5,(T), then [(7'")'G],, ¢ Po for all we Qrap+q) - 
Thus, statement (vizz) implies statement (viz). Since we have proved 
that (viz) is equivalent to (2), it suffices to prove that (z) implies (v772). 
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Suppose that 1/7'G e P, for all M ¢ 3(T). Let MW be an arbitrary ele- 
ment of 3,(7). Then [AZ + 6(7 — M)]  3(T) for all 6 e (0, 1], and 
therefore [JZ + 6(7 — M)|'G « P, for all 5 « (0, 1]. At this point a 
continuity-type argument similar to that used in the proof of [(z7) and 
(iv) are equivalent] shows that W"Ge P,. O 


3.13 Corollary 1 (Corollary to Theorem 6): 


If Te Sand M'Ge P, for all M ¢ 3(T), then det (cI + J.) € 0 for all 
o = Oandalluse E°*® provided that for all j f;(-) is continuously dif- 
ferentiable on (— ~, ~) and fi(a) > 0 for allae (—, ~). 


3.14 Proof of Corollary 1. 


If Te Sand M "Ge P, for all M ¢ 3(T), then, by the equivalence of 
(¢) and (cz) of Theorem 5, (or + T)7'(cc + G) « P, for all c = 0. The 
first portion of the proof of Theorem 1 shows that if (e7 + 7')7'(e¢ + 
G) « P, for all o 2 O and if for all j f;(-) is continuously differentiable 
on (— ©, ©) and fi(a) > Oforallae(—o, o), then det (oJ + J,) #0 
for alle = Oandallue B°™®. 


3.15 Definition 10: For p > 0 let Qt.,+,) denote the subset of Qio+.4) 
containing all sets w belonging to Qi2,,) such that w is not the null set 
and 2k « w if and only if (2k — 1)ewfork = 1, 2,---, p. For p = 0, 
let Qon4q) Genote the family of all sets contained in Qis»+,) with the 
exception of the null set. 


3.16 Theorem 6: If Te 5 and T'Ge Py, then (T,)'G, & Po for all 
We Qlop+a) * 
3.17 Proof of Theorem 6 


Let Te 3, and let 7-'Ge P,. By Lemma 1, det (TD + G) # 0 (and 
hence det (TD + G) > 0) for all diagonal D > 0. Let w = {t1,%,---, 


2} © Qlop+q , and let d;, = dfork = 1, 2,---,7. 
It may be the case that (7D + G) is a block matrix of the form 
f™ = Gs Ay, (23) 
Hoy (dT + Hy») 


in which 7 is a direct sum of all 2 X 2 and 1 X 1 block matrices on the 
diagonal of 7’ which do not appear in 7’, , and Hi. , Ho, and He» are 
independent of D. Clearly det 7 > 0. If (TD + G) is not of the form (23), 
then by a sequence of interchanges of rows and corresponding columns of 
(TD + G) we obtain a matrix of that form. 
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Thus, for some 7 of the form indicated above and for the correspond- 
ing constant matrices H,., H2; , and Hz. whose elements are elements 
of G, 


(TD+G),, Hy | 
Aye. (dT ++ Hs») 


for all d; > 0 for j ¢ wu. For all sufficiently large d > 0, det (d7 + Hz) 
> 0, and then 


0 < det (TD + G) = det (df + H.,)-det [(TD + G), 
= H,.(dt + Has)" Ha] 


det (TD + G) = det | 


for alld; > 0 for j ¢ w. Since Hi2(dT + Ho.)~'H2; approaches the zero 
matrix of order (2p + q — r) asd > ~, we must have det (TD + G),, 
= 0 for alld; > 0 for 7 ¢ w. Therefore, since (7D), = T,D, , we must 
have det (T7,,D, + G,) 2 0 for all D, > 0. But this means (see Lemma 
2) that det (T,D, + G,) # 0 for all D, > 0. Thus, by Lemma 1, 
(T..) G,eP,. O 


3.18 Theorem 7: If Te 3 with p = lor p = 2, and if T'G « Py with 
G the short-circuit conductance matrix of a transformerless positive- 
element resistance network, then (T.,)~'G.,, © Py for all we Qrap+q - 


3.19 Proof of Theorem 7 


Suppose that 7''G e P, with p = 2. Theorem 6 asserts that (7',,)7'G, 
¢ Py, for all w ¢ Qu,.,.)- But, aside from the null set, the sets 
w= {t,%2,°+: , 7} that are contained in Qi,4.) but not in Qo,.4) 
possess the property that T, = 7, @ Tesi.» , or T, =uQ@ TT. @ 
Iiieg-ry Where u is the 1 X 1 matrix containing the element 1, or 
Le = eps . 

If T, = I¢4zg-r), then obviously (7.,)'G, ¢ Po. If T, = Ti ® 
Tiasq-r), then for any D,, = diag [D2 ® Deosg-y]| with D. > O and 
Dio+¢-1) > O diagonal matrices of order 2 and (2 + q — 1) respectively, 


TD, i Gis Gio 
Gas Di24q=%) fe Goo 


det (TD, + G,) = (24) 


in which Gi;, Gis, Go, , and Gye are the appropriate block matrices 
of G,, . Since det [Dio.,-+) + Ge2] > 0, we have 


det (TD. + G.) — det [Diose—n) + G..]- det {TD. + Git 
~~ GilDier¢-n oP Goo] 'Gox}. 
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But Gir — GilDesg-y + Ge2]"'’Go: is the short-circuit conductance 
matrix of a transformerless common-ground 2-port network; it is of the 


form 
eS ae 
Gre J22 
with gi, = 0, goo = 0, Ji2 2 0, Grr = Gig, aNd Joo = Ji2. Therefore’ 


det {T,D, + Gu - Gi2[Doo+a-r) ae Goo)" 'Ga1} > 0 


for all D. > O and all Deos,-) > 0, det (T.,D.,, + G,) ¥ 0 for all D,, > 0, 
and hence, by Lemma 1, (7,)"’G,, « P,. Finally, the case in which 
T. = U@ T.@ Lasg-y can be treated in a manner similar to that used 
to show that (7,,)"’G, ¢ Po when T, = T, ® [voa,-n , since, with w 
such that 7, = u@ T. ® Lasg-y , and with D an arbitrary diagonal 
matrix of order (4 + q — r), a sequence of interchanges of rows and 
corresponding columns of (7',D + G,,) can be performed to obtain a 
matrix of the type that appears on the right side of equation (24). 
Therefore (T',,)’G., © Po for all we Qrap+q) - 

When p = 1, aside from the null set, the sets w = {7,,7%., °-- , 2,} 
that are contained in Qi2y4,) but not in Qi...) possess the property 
that T., = Ico+-1 and obviously when T,, = D,o.¢-) , (Tv) "Gye Po. 0 


3.20 Theorem 8: Let T ¢ 3 and let G possess the property that for some 
diagonal matrix D > 0, both DT and DG are strongly-column-sum dom- 
inant. For each 7 = 1,2, +--+ , (2p + q) let f;(-) be a continuous mono- 
tone-nondecreasing mapping of E* into itself such that f;(0) = 0, let 
he (0, ©), and, with F(-) and C(-) defined relative to the f;(-) as in 
Section 2.1, suppose that the sequences {y,} and {w,} in E°?*® satisfy 


Yuor + AL LEIC™ (Ynsr)] + GC Yns1)} = Yn + Wn 


for alln = 0. Then there exists a positive constant 6 depending only on the 
c; , ther; , T, G, and D such that 


@) [| Dy th SQ + 8) |] Dye th + + 6) |] Drea Ih 


for alln = 1, and 


n 


GH) || D@n— Ge) Ih SA + 8h) "|| DYo — Go) [hi He QU. -+ 4h)™ 
for all n = 1, in which {,} is any sequence in E°*® with the property 
that || Dg, — y*) ||: S ¢ for alln = 1 with ¢ a positive constant and the 
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sequence {y*} such that 
Uae + A{TF(C '(y%.1)] Sg GC" '(y#.1)} =. Yn + Wr 
for alln = 0. 


3.21 Proof of Theorem & 


We shall first prove part (7). With D such that DT and DG are 
strongly-column-sum dominant, we have for all n = 0 


Dynar + h{ DTF[C™ (Ynss)] + DEC (Yuis)} = Dyn + Du, 
and 
Dyk. + h{ DTF[C”'(y3.1)] + DEC (y#.1)} = Dy*¥ + DG, — yX) + Du, 


in which we shall take y% to be 7. As in the proof of Theorem 2 of 
Ref. 3, we write 


FIC yeu] ~ FIC] = ding (2M) ys — ys) 25) 


and 
il 
Cc *(Yn+1) “ Ce a = az diag oy +. rr(n), <= Ne — fea) (26) 


in which r(n); depends on the jth components of y,,, and y*,,, and 
r(n); 2 0 for all n = 0 and all j. 
Thus, with Q = DTD™ and R = DGD", 


{r + hQ diag ( vb + AR diag (5) on — Yrvi) 


¢; + r,;7(n); 
= Diy, — y%) — DG. — y% 


for alln = 0. At this point we shall use the proposition that if 1 is any 
real matrix of order (2p + q) with the property that there exists a 
positive constant 7 such that m;; — 0:2; |i; | = 7 for all j, then 
|| Wax ||, = || 2 ||, for allae HO*? . Now let 


= ' r(n); )+ = 1 ~~ 
M = u + hQ diag (as rh) hk diag Gam, 
for arbitrary n 2 0. Then for arbitrary j 


_ | (+... 
a> [33 | = 1 + has (> + T,r(n); ag her; C; + 7 r(n); 


rn); | 
a yD a C; om 7r(n); oe Cj es 7;r(n); 
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r(n); 
> 1+ nee — Dp asl) ac Gy 


; 1 
a n(n, = x rl) ee 71); 
2 1+ dh, 
in which 
= min {min oF'(r = p» [ tes ) ’ min sia eS rs | gis )t. 
Therefore 
[| D@ns1 — ye) th 
(1 + 6h)™ || Dy, — y*) — DG — y*) |h 
(1 + 6h)~ || Dy. — y%) Il, + A + 8h) || DG — y®) Th 
(1 + 6h) || Dyn — y%) | + el + 6h) 


for all mn = 0, and hence 


IIA IIA 


IA 


|| Dy. — y*) Ih S A + 6h) || Dy — y&) | ls + >> (1 + 6h)" 


for all n = 1. Finally, since || Diy, — 9) ||; S || D@. — y*) ||) + 
| Ds — Gp) [hh S || Dy» — y%) ||; + ¢, and since y4 = Go, 


|| De = Ge) lh SQ + a) |] De — Gh) Ib +e A+ Hy 


for all nm = 1, which completes the proof of part (27) of the theorem. 
The proof of part (2) is similar to that of part (7). Using 


Dynsr + R{DTF(C™ Yner)] + DEC Yns1)} = Dyn + Du, 


for all n = 0, and equations (25) and (26) with y*,, = 6 for all n, we 
find that 


|| Dyer [Lp S (1 + 24)" || Dyn |]: + CL + 2 6)™ || Du, | 
for all = 0. Therefore 
| Dy, lls < (1 + hé)™” || Dyo Il, ai d (ieee hé)~* | DW n-%) ll, 


foralln 21. 0 
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3.22 Theorem 9: Let T « 3 and let G possess the property that for some 
diagonal matrix D > 0, both DT and DG are strongly-column-sum dom- 
znant. Let B(-) denote a real continuously-differentiable (2p + q)-vector- 
valued function of t for te [0, ©) such that both B(-) and (d/dt)B(-) are 
bounded on [0, ©). With F(-) such that each f;(0) = 0, and with C(-) 
defined relative to F(-) as in Section 2.1, let u(-) satisfy 


a + TPIC QW] + GC) = BY), t= 0 


and, with h an arbitrary positive constant, let u, denote u(nh) for all 
n = 0. Let {y,} be a sequence in E?*” such that 


Ynsr + ALTFIC™ Ynsa)] + GC Ynss)] = Yn + ABU(n + YA], n =O. 
Then there exist positive constants 6 and p, both independent of h, such that 


[| Dun — yn) Ila S A + 6h)™ || Do — yo) Ila + ph 


for alln = 1. 


3.23 Proof of Theorem 9 


The sequence {u,} satisfies 
Un+1 + h{ TF[C™* (un+1)] + GC™*(Uns1)} 
=u, + Bln+ 1h] +i, n20 


in which &, is often referred to as ‘‘the local-truncation error at step 
n.”’ We shall first bound &, . 

Since B(-) is bounded on [0, ©), and since for some D > 0, both 
DT and DG are strongly-column-sum dominant, a direct modification 
of the proof of Theorem 1 of Ref. 5 shows that u(-) is bounded on 
[(0, ©); and hence since 
d’u 


ot = SATFC@)] + ECW} — TBO +E BO, 420 (22) 


with (d/dt)B(-) and the elements of the Jacobian matrix J, bounded, 
it is clear that (d’u/dt’) is bounded on [0, ©). By the usual Taylor- 
series-type argument we can show that for arbitrary n = 0, & = 3h°U, 
in which for each 7 the jth component of U, is the jth component of 
(d’u/dt’) evaluated at some point contained in the interval [nh, (n + 
1)h]. Thus there exists a positive constant p, such that 


|| Dé, ||) S 4h7%p, forall n 2 0. (28) 
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Therefore, using (28) and the equations 
Uner + ALTFIC™ (uins1)) + GC (uns) 
=u +Bla+Dh]+ih, 
Yasr + h{TF[C™ (Yusr)] + GO Yn+1)} 
= Yn, + Bi(n + 1A], n2z0 


by an argument similar to that used in the proof of part (22) of Theorem 
8, and with 6 as defined there, we find that 


|| D(Uns1 arms Yuri) Ils s (1 + 5h)7* || D(un = Yn) iP + (1 + bh) *3h*p, 
for all n = O, and hence that 


IV 
o 


[; D@n — yn) |]> S (1 + 6h)™ || Dio — yo) [Ia + $h' pr 2. (1 + 6h) 


(1 oe. oh)” || D(uo — Yo) ll, = sh’ pr > (d+ bh)" 


Ss (1+ 6h)" | Dp — Yo) Il, Ae ghd "py 
foralln 2 1. 0 


3.24 Definition 11: LettR = Ri @®R2DB--: PR, @ RK, in which 


Ry = diag (r,, t2, °** , 7) With r; 2 Oforj = 1, 2,--- ,q and 
R, _ ri ae ri? ri? 
7? ri? + ri 
with r? = 0,r{” = 0, andr” = Oforallk = 1, 2, --- , p. As suggested, 


ifg=0,thnR=R,OR.@®-:-:-@OR,, whileif p = 0, thenkR = Rk. 
3.25 Theorem 10: Let T 23. If p > 0 and zf R satisfies 
rd Oat ae eer 
aL = of) = 9 
fork = 1, 2, --- , p, then T"GU + RG) « Py whenever TG e Py. 
3.26 Proof of Theorem 10 
By Lemma 1, T'G(I + RG)™ « P, if and only if 
det [TG + RG)" + D*] #0 (29) 
for all diagonal D* > 0. But (29) is satisfied if and only if 
det (T-'G + D*RG + D*) # 0. 
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Here, since 
(k) (k)\-1,,(k) (k) 
a, (l—a,) re” = 
(k) (&)\-1,,(k) (k) 
ay (1 — as ) To = 1, 


fork = 1,2,---,pwehaveR = DT for some diagonal matrix D 2 0. 
Thus (29) is satisfied if and only if 


det [J + DD*)T"G + D*] # 0. 
When 7’G ¢ P, we have 
det (T'G + D) #0 


for all diagonal D > 0. Thus (29) is satisfied for all D* > 0 whenever 
TG & var . oO 

3.27 Theorem 11: If M~’Ge P, for all M £3, then for any T 3, 
T GU + RG) « Py for all R. 

3.28 Proof of Theorem 11 


Let 7’ ¢ 5. As in the proof of Theorem 10, 77'G(I + RG)" « P, if 
and only if 


det [((T* + D*R)G + D*] #0 


for all diagonal D* > 0. It is a simple matter to verify that for each 
D* > 0 and each R there exists an M ¢ 3 and a diagonal matrix D > 0 
such that (77* + D*R) = DM. Since M~’G ¢ P, for all M « 3, we 
have (by Lemma 1) 


det (DMG + D*) #0 
for all D*® > 0. OF 


3.29 Theorem 12: Let Te 3 with p > Oandq = 0. Then M'G ¢ Py for 
all M ¢ 3(T) if and only if T"G(I + RG) « P, for all R such that 


afP(L — af) rf 
a = afr Bri? 
fork = 1,2,---,pandr; 2 0 for allj such that! Sj Sq. 
3.30 Proof of Theorem 12 
As in the proof of Theorem 10, 7-’GU + RG)™ « P, if and only if 


det (T"'G + D*RG + D*) #0 (30) 
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for all diagonal D* > 0. The inequalities 7; 2 O for all j such that 1 S 
7 S qand 

a (1 = af®)- , 

a (1 = as)“ ® 
fork = 1, 2,---+, pare equivalent to the condition that R = D,T~* + D, 
for some diagonal matrix D, = 0 and some diagonal matrix D, « S, in 
which S is the set of all diagonal matrices D = 0 such that DT is 
symmetric. Hence 7'G(I + RG)~ « P, for all such # if and only if 


(k) 
b 


(k) 


= 
af, 


det {[(J + D,D*)T™ + D*D,|JG + D*} #0 (31) 
for all diagonal D* > 0, D. 2 0, and D, ¢ 8. 
Let A = diag (Ar, Ae, °** » Acapegy) be such that 
D, = D** A“ AC + D,D*) 
in which 
KS Aiap (0)5.16) foes Oy os) OL 
ifg > 0, A = diag (6,, 6,, 62, 62, °°: , 6), 6») if g = 0, and 


jee be aa? ter es 4p: 
The left side of (81) is 
det [IJ + D,D*)(T™' + AVA)G + D*] 


which can be written as 


det [I + D,D*) A*(I + A) Ay T7°G + D*| (32) 
with 
Ty’ = Ay AU + A(T + AA) 
and 
A, = diag (41, 61, 53, 62, --- , 53, 65) OI, 
ifg > Oand Ay = diag (67, 6{, 63, 68, --- , 6%, 64) if g = 0, in which 
fork = 1,.2) 86% 3p 


bf = 1 = apa? (1 + Neer) (1 + An). 


But (32) vanishes if and only if det (7;'G@ + D) vanishes, in which 
D = Ay’ + A) AU + D,D*)"D*. We observe that D is a positive 
diagonal matrix and that given any diagonal D’ > O and given any 
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A = Owe can choose D* > 0 and D,¢ 8 so that D =D’. Thus T'G(I. + 
RG) « Py for all R = (D,T™* + D,) with D, « S and D, = 0 if and 
only if 

det (T;'G + D) #0 
for all A = O and D > O, that is, if and only if 7;’G e P, for all A = 0 
(see Lemma 1 of Section 3.1). But 


T,.=%™|@0T.0::- OFT,@O1, if q>0 





and 
T,=T,OT.@0::-@T, if q= 0 
with 
(k) 
1 TE ee 
T, = i) + Non—1 
as? 
1 
Lie Ase 
for all k = 1, 2, --- , p. Therefore 7 'G(I + RG) « P, forall R = 


(D,T™* + D,) with D. = 0 and D, « S if and only if M~’G « P, for all 
Me3(T). O 


3.31 Definition 12: Let 5; denote the set of all F(-) such that 


(i) F(-)e 5Y?*® , and 
(iz) for each 7 = 1, 2, --- , (2p + gq) there exists a real constant 
6; such that f;(-) is a strictly-monotone-increasing mapping of 
E" onto either (8; , ©) or (— ©, 8;), and 
(it) whenever p > 0, fcor-1) (-) and f.(-) are both bounded on either 
[0, ©) or (— ©, O] fork = 1, 2,---, p. 


3.32 Theorem 13: Let T © 3, and, referring to the network of Fig. 1 in 
which it is assumed that R (see Section 2.1) ts the zero matrix, let G denote 
the short-circuit conductance matrix of the linear portion of the network. 
(The linear portion 1s assumed to contain only sources and linear resistors 
of nonnegative resistance.) Then the equation F(x) + T'Gx = B pos- 
sesses a unique solution x for each F(-) © 3 and each Be E”*® if and 
only if T'G e Py and det G # 0. If T~’Ge P, and det G = 0, then there 
exists a real (2p + q)-vector n such that (2) n ¥ 6, and for some F(-) € 5 
all of the components of F(an) are bounded on ae [0, ©), and (tz) for any 
F(-) © Ss with the property that all of the components of F(an) are bounded 
on ae (0, ©) the equation F(x) + T-'Gx = B does not possess a solution 
for some Be E°?*® , 
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Fig. 1—General network containing transistors, diodes, resistors, and sources. 


3.33 Proof of Theorem 18 


(if) If T-’G « Py with det T-'G # 0, and if F(-) « $5, then, since 
each f;(-) is a strictly-monotone-increasing mapping of E' onto (6; , ©) 
or (— ©, 8;) for some real constant 6; , by Theorem 4 of Ref. 2, the 
equation F(x) + T’Gx = B possesses a unique solution x for each 
B © Erte | 

(only if) Assume that 7”'’G ¢ P,. Then since $; is contained in 
§??*9 by Theorem 1 of Ref. 3, for each F(-) e Fs there existsa Be H°?*” 
such that there are at least two solutions x of F(x) + T'Gz = B. 

Assume now that 7’’G e« P, and that det G = 0. We shall use the 
proposition that if R(-) is any continuous mapping of E“°’*” into itself, 
then R(-) is a homeomorphism of H°’*” onto itself if and only if R(-) 
is a local homeomorphism on E°”*” and || R(x) || > ~ as || x || > o.* 

Let R(-) be defined by the condition that R(x) = F(x) + T”’Gzx for 
allze H°’*® . For any F(-) « 3 the operator R(-) is a local homeo- 
morphism on £°”*® , since with F(-) such that each f;(-) is a strietly- 
monotone-increasing mapping of EZ" onto HE’ the mapping [F(-) + T7’G] 
is a homeomorphism of E°’*” onto itself.’ In addition, for any F(-) e F; 
and any Be E°”'® , there is at most one ze £*”*® such that R(x) = B.’ 

Let us suppose that for each B e E°*® and each F(-) « ¥5 there 
exists a solution x of R(x) = B. Then for all F(-) e §; , R(-) is a homeo- 
morphism of E°’*” onto itself, and hence for all F(-) © || R(x) || > ~ 
as || x || > ©. But, by Lemma 3 (which appears below) E?*” contains 
a vector 7 such that 7 ¥ @, n; © {0, +1, —1} for all 7, and Gy = 6; and if 


t See Ref. 12 and the appendix of Ref. 13. 
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p > 0, 7 satisfies y¢o.- 42, 2 Ofor all k = 1, 2,--- , p. Let $3(n) denote 
the subset of ¥; containing all elements /'(-) with the property that 
f;(an;) is bounded on ae [0, ©) for all7 = 1, 2,--- , (2p + q). Since 


Nor-yNx 2 O for allk = 1, 2,--- ,p when p > 0, itis clear that $5(7) 
is not empty. However, for any F(-) ¢ &3(n) we have || R(e7n) || = 
\| F(en) || with || F(an) || bounded on a ¢ [0, ©), which contradicts the 
assumption that there exists a solution x of R(«) = B for each F(-) ¢ 5 


and each Be B°?*® , 


Lemma 8: Let G be the short-circuit conductance matrix of the linear 
portion of the network of Fig. 1. If det G = 0, then there exists a vector 
ne E?* such that Gn = 0, » ¥ 0, and n; « {0, +1, —1} for all j = 
1,2,--- , 2p + q); and if p > 0 n also satisfies yoox,-1y 2. 2 0 fork = 
1, 2, sty Dp. 


Proof of Lemma 8: 


Let N denote the (2p + q)-port resistor network obtained from the 
network of Fig. 1 by removing all transistors and diodes and by setting 
the value of each source to zero. The short-circuit conductance matrix 
G possesses the property that if ve H°’*” denotes the vector of port 
voltages of N and ie Z°’*” denotes the corresponding vector of port 
currents (with polarities as indicated in Fig. 1), then 7 = —Gv. 

Let det G = 0. Then the open-circuit resistance matrix of N does not 
exist. Therefore there exists a port ¢ of N such that there is no path 
through resistors of N that connects the two terminals of port ¢ when 
all other ports are open-circuited. Let a one-volt source be placed at 
port ¢ so that v, = 1. Then when all ports 7 of N with 7 ¥ £ are open- 
circuited, 2, = 0 and there is zero current in every resistor of N. Let S 
denote a set of port numbers of N with the following properties. The 
number ¢ is not contained in S and when all ports 7 with 7 « S are short- 
circuited and all ports j with 7 ¢ S U {¢} are open-circuited then zero 
current flows through the one-volt source; when any port j, ¢ S U {é} 
and all ports 7 with 7 « S are short-circuited and all ports 7 with 7 ¢ SU 
{£, j:} are open-circuited then nonzero current flows through the one- 
volt source. It is clear that such a set S exists (with the understanding 
that S might be the null set). In general S contains r port numbers 
whereO0 Sr S (29+ q — 1). 

Ifr = (29 + q — 1), then with v, = 1 and with all remaining com- 
ponents of v equal to zero, we have Gv = 6. Obviously in this case we 
can take the vector 7 of the statement of Lemma 3 to be v. 

Ifir ~ 2p + q — 1), then, with », = 1, with v; = 0 for allj ¢S, 
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and with all ports 7 ¢ S U {¢} open-circuited, there exists for each 
j¢SU (£} some path through the one-volt source and the resistors of 
N that connects the two terminals of port 7. Therefore when r # (2p + 
q — 1), when all ports7 ¢ SU {£} are open circuited, when v, = 1, and 
when v; = 0 for all 7 « S, the open-circuit voltage v; at each port 7 with 
j¢SU {4} is well defined and nonzero. Since no current fiows in any 
resistor of N when v, = 1, v; = 0 for all je S, and all ports7 ¢ SU {4} 
are open-circuited, it follows that v; © {—1, +1} for all j7 ¢ S. With 
v, = 1, with v; = 0 for all j e S, and with v; the corresponding open- 
circuit voltage for each j ¢g SU {£}, we have Gu = 6. When p > O, the 
vector v also satisfies the condition that v.,-)v2, 2 0 for all k = 


1, 2, --- , p since if 0(2,~-1)¥2, were negative for some k, then for that 
k vant) = land v., = —1 or 04-1) = —1 and v, = 1; in either case 
| Uon-1) — Vox | = 2 which contradicts the proposition that a network 


of nonnegative resistors can have no voltage gain. O 


APPENDIX* 


A theorem due to R. S. Palaist asserts that if R(-) is a continuously- 
differentiable mapping of £” into itself with values R(q) for qe EH”, 
then R(-) is a diffeomorphism’ of E” onto itself if and only if 


(<) det J, ¥ 0 for all qe EX”, in which J, is the Jacobian matrix of 
R(-) with respect to q, and 
(zz) || R@) || > ~ as |[q|| > &. 


If R(-) is any twice-continuously-differentiable mapping of EZ” into 
itself such that conditions (2) and (iz) of Palais’ theorem are satisfied, 
then &” contains a unique element x such that R(x) = 6 in which 6 
is the zero element of EZ”, and there are steepest decent as well as 
Newton-type algorithms each of which generates a sequence in L” that 
converges to x. To show this, let * f(y) = || R(y) ||’ for all y e EZ” in 
which ||-|| denotes the usual Euclidean norm (i.e., the square-root 
of the sum of squares). Since condition (z) of Palais’ theorem is satisfied, 
the gradient Vf of f(-) satisfies (Vf)(y) 4 6 unless f(y) = 0,° and 
since condition (77) of Palais’ theorem is satisfied, the set S = {ye EZ”: 
f(y) S f(v)} is bounded for any x ¢ EZ”. Therefore we may appeal to, 
for example, the theorem of page 48 of Ref. 18 according to which for 
any x” ¢ EH”, for any member of a certain class of mappings ¢(-) of S 

* The material of this appendix together with some misprints appears in Ref. 3. 

t See Ref. 12 and the appendix of Ref. 13. 

t A diffeomorphism of £, onto itself is a continuously differentiable mapping of 


E, into E, which possesses a continuously differentiable inverse. 
§ Here we have used the fact that (Vf)(y) = 2/,'*R(y) for all y « Hy." 
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into Z”, and for suitably chosen constants yo , y; , °°, the sequence 
a, 2, --- defined by 
P= 7 4 vy o(¢) forall k = 0 

belongs to S and is such that || R(x”) || ~ 0 as k — ©. However, 
since R~’(-) exists and is continuous, it follows from 

a” = RUR(e™)| forall k = 0 
and the fact that R(x“) > @ as k > o, that lim, 2 exists and 

lim 2 = R76), 


k-00 


(k) 


which means that lim, z‘”’ is the unique solution x of R(y) = 0. 
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Characterization of Second-Harmonic 


Effects in IMPATT Diodes 


By C. A. BRACKETT 
(Manuscript received May 20, 1970) 


We discuss characterization of the tuned-harmonic mode of operation in 
IMPATT oscillators, and introduce an equivalent circuit which incor po- 
rates the large-signal, ‘‘single-frequency” oscillator admittances at the 
fundamental and second-harmonic frequencies. Complete characterization 
of this mode is equivalent to specifying the behavior of each of the four 
elements of the equivalent circuit as functions of the oscillation state vari- 
ables: fundamental voltage and frequency, second-harmonic voltage and 
relative phase. Using the approximate large-signal analysis of Blue,’ the 
values of the equivalent circuit elements are presented, as an example, for 
a 6-GHz IMPATT diode under a variely of oscillation conditions. This 
equivalent circuit is used to clarify the role played by the fundamental and 
second-harmonic, single-frequency oscillator admittances in the tuned- 
harmonic mode. 

Using an approximation to the equivalent circuit, we investigate the 
criteria for stable oscillation of the tuned-harmonic mode. It 1s found that 
the stability criteria are in general quite restrictive. For the same 6-GHz 
germanium diode, the range of stable phase is investigated, as a function 
of the RF parameters, for certain special cases. It 1s found to be possible 
to satisfy the stability criteria for the phase which gives an optimum en- 
hancement of the fundamental power output zf certain conditions on the 
external RF circuit are satisfied. 


I. INTRODUCTION 


It was found by Swan’ that the introduction of a trapped resonance 
at the second harmonic of the oscillation frequency in a 6-GHz Ge 
IMPATT diode oscillator provided dramatic increases in the output 
power and efficiency, as compared with the results obtained with the 
ordinary single quarter-wave transformer coaxial circuit. Since that 
time several authors’’** have reported both theoretical and experi- 


1777 


1778 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1970 


mental examinations of the effect. It appears that the addition of a 
properly phased second-harmonic voltage improves the phasing of the 
RF current relative to the fundamental voltage so as to increase the 
negative conductance and (at least at lower frequencies) give an in- 
crease in the power output at the fundamental frequency. The circuit 
conditions required for the observation of this effect have been incom- 
pletely understood. 

The purpose of this paper is to present the results of an analytical 
study of the interaction of an IMPATT diode with a circuit having 
resonances at two harmonically related frequencies. The analysis is 
begun by the introduction of an equivalent circuit for the diode by 
which these two-frequency oscillators may be characterized. A stability 
theory is then developed along the lines taken by Kurokawa which 
examines whether a particular circuit, even though matching the 
impedances required by the diode at both frequencies, will or will not 
provide a stable oscillation.’ The stability theory is examined in 
some generality, and three special cases are studied for which tractable 
analytical results can be obtained. It is found that in the case of zero 
fundamental or second-harmonic voltage, the theory reduces to the 
single-frequency stability criteria derived by Kurokawa. In more 
general cases, the theory indicates that by designing (or adjusting) 
the circuit carefully one can obtain stable operation at phase angles 
which enhance the fundamental power. However, the theory also 
indicates that stable operation may be impossible if the circuit-diode 
interaction is not just right, even though the diode and circuit are 
matched to each other at the two frequencies. 

In a final section, a numerical example is given in which the theory 
is applied to a model of a 6-GHz germanium IMPATT diode, using 
the approximate large-signal analysis of Blue." 


II. TWO-FREQUENCY CHARACTERIZATION 


The IMPATT oscillator is truly a single-frequency oscillator only at 
very small ac voltages and currents. At larger signal levels the non- 
linearity is very strong, and therefore there should be strong inter- 
actions between harmonically related signals. However, by operating 
the diode in a well-designed single-frequency circuit, the power output 
may be limited to a single frequency. This may be done, for example, 
by presenting short-circuit, open-circuit, or reactive loads at the har- 
monic frequencies. In the case of short circuited harmonics, the har- 
monic voltage amplitudes V, are zero, and only the fundamental 
voltage V, is nonzero. It is then common practice to calculate a large- 
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signal diode admittance as a function of V,; and to use this admittance 
to describe device behavior. On the other hand, for the case of open- 
circuited harmonics, the harmonic currents, J, are zero, and only the 
fundamental current J, is nonzero. It is then preferable to characterize 
the diode by a large-signal impedance which is a function of the RF 
current amplitude J,. Both of these conditions constitute tunings at 
the harmonic frequencies, albeit ones that are particularly useful and 
simple to express analytically. 

To consider other, more general, loading conditions at the harmonic 
frequencies, one must introduce two more variables (amplitude and 
phase) for each additional frequency for which the amplitude is nonzero. 
One of the most important points is that the input admittance (for 
example) at the fundamental frequency is no longer a unique function 
of V, and the frequency f; but instead defining the state of oscillation 
requires a vector whose components are V,, °-- , Vw, f, 92, °°* » ¢w 
where N is the maximum harmonic number of interest and y, is the 
phase of the kth harmonic voltage relative to the fundamental. This 
vector does uniquely describe the state of oscillation, and for every 
such vector, there exists a set of complex admittances y; --- yy which 
are uniquely determined. If this is not so, it simply means we have 
inadequately described the system and must include more component 
signals, either harmonics or subharmonics. 

We shall limit the discussion to include only two harmonically related 
frequencies and consider that V, = 0 for k > 2. This also means that 
we will only discuss the admittance characterization and not the im- 
pedance characterization. 

A convenient way of utilizing the information already known about 
the large-signal single-frequency admittance of the diode is to separate 
the input admittances at the two frequencies as shown in Fig. 1. This 
equivalent circuit shows a fundamental port and a second-harmonic 


| yi2V2e3%2 LD” 
Yams 





Fig. 1—Equivalent circuit of the IMPATT diode which includes nonzero voltages 
at two harmonically related frequencies. Port 1 is the fundamental port and port 2 
is the second-harmonic port; y11 and yz are the large-signal single-frequency diode 
admittances at the fundamental and second-harmonic frequencies, respectively. 
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port. The admittances yi:(Vi) and yo2(V2) are the large-signal single- 
frequency admittances that would be measured at the fundamental if 
there were no harmonic (or subharmonic) voltages present. That is, 
they are just the ordinary large-signal admittances y(V) at the fre- 
quencies f and 2f. 

The admittances 412.(V,, V2, f, ¢2) and Yyoai(V,, Vo, f, ¢2) account 
for the conversion of current between the two frequencies and it is the 
study of their effects that is the main subject of this paper. The phase 
¢2 18 defined by the assumed voltage waveforms 


v,(t) = Vy cos wot 
and 
Vo(t) = Veo cos (Qwot + og). 
The input admittances are 


V , 
Vint = Yu + tha PEEP Le) (1) 
1 


and 


V . 
Ving = Yor + Yar Teme) (2) 


at the fundamental and second-harmonic frequencies respectively. 
Since yi: and Yo. are independent of the phase ¢- by definition, equations 
(1) and (2) show that the input admittance loci for fixed V, and V, will 
be counter rotating closed curves as a function of vy, . These curves will 
enclose the admittance points y,, and Yo, separately providing that 
Yio and Yo; are not strong functions of gy». If, for example, yi. and yo: 
are independent of g., Yiny and Yin will be circles centered about 
Yi. and Yoo respectively, the radii of which depend upon the ratio 
V./V,. They generally turn out to be somewhat elliptical in shape* 
although, in many cases, of very low eccentricity. 

Figure 2 is the calculated’ large-signal, single-frequency, admittance 
plane plot for a 6-GHz germanium diode, from which y,, and 2.5 may 
be obtained directly. Figures 3 and 4 show Y;,; and Y,,2 for various 
fundamental frequencies when the voltages are held constant, demon- 
strating the elliptical and circular behavior noted above. Note that in 
Fig. 4 the second-harmonic input admittance has a positive real part 
for some ranges of the phase ¢.. To operate at such phase angles and 
RF voltages, the external circuit must supply power to the diode at 
the second-harmonic frequency, and thus these conditions are un- 
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Fig. 2—The calculated large-signal single-frequency admittance of a 6-GHz 
germanium IMPATT diode at a bias current density Jo = 340 A/cm? 


realizable when operating into a passive circuit. The diameter of these 
admittance contours is inversely proportional to the second-harmonic 
voltage amplitude V,, however, so that at higher values of V2, the 
entire contour may lie in the left-half plane. 

The rather simple structure of the Y;,; and Y;,2 loci of Figs. 3 and 4 
suggests that y,2 and y2, might be rather insensitive functions of ¢, . 
This is borne out by the plots of Fig. 5 in which y,. and yz; are shown 
at constant fundamental voltage V, and several values of V,, with ¢, 
ranging 0 S yg. S 272. This figure also establishes that y.2 and yo: do 
not change drastically as a function of V,. It was also found that yy. 
and Y2, depend upon V, in an approximately linear fashion. This is 
shown in Fig. 6 where y,./V; and Y2:/V; are plotted versus V, for 
several values of vy. with V. constant. Thus, for moderate values of 
V, and Vz, we can make the approximation that yi. and Ys; are both 
proportional to V, and independent of ¢, and V,. To demonstrate this 
analytically, let the phase of the fundamental voltage ¢, ~ 0, and con- 
sider a power series expansion of the currents 71. = Y:2V2 exp (jee) and 
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Fig. 3—The input admittance, Yin, at 3, 4, 5 and 6 GHz as it is modified by 
the presence of a second-harmonic voltage for V; = 10 volts, V2 = 1 volt and 
Jo = 340 A/cm’. 


421 = YoVi exp (Jg,). Selecting the lowest-order terms having the 
appropriate frequencies, we find that 

Yio < V, exp (—Je¢1) 
and (3) 


Ya « Vy exp (j¢1) 


which confirms the approximate linear dependence on V, and gives 
the appropriate form of the ¢, dependence. It will be convenient later 
to approximate Y;2 and Ys, by the quantities 
Gio = K,V, exp (—joi) = 4 Vi exp [—j — W)], (4) 
Yo = K2V, exp (j¢,) = kV exp [ji + ¥2)], 
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where «x, = | K, |, m = | K.|, ¥. = arg (K,) and y. = arg (K.). Note 
that for ¢, = 0 (only the phase g. — ¢, is important), ¥, = arg (y12) 
and y~. = arg (Yo:) which is what will usually be assumed. 

The quantities 7. and 9, may be defined as the average of y,. and 
Yo: over the phase g,. For the 6-GHz oscillator example, the calcu- 
lated values of %12 and Js; as a function of frequency are shown in Figs. 
7 and 8 and the phases y, , Y2 and ¥, + wy. are shown in Fig. 9. Obvi- 
ously these are only first-order approximations, but the complexity of 
the stability analysis requires some suitable approximation to obtain 
qualitative understanding. 

The interaction of the diode equivalent circuit of Fig. 1 with an 
external circuit can be visualized by connecting an admittance Y,. to 
the second-harmonic port. The fundamental input admittance is then 
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Fig. 4—-The input admittance, Vine, at the second harmonic of 3, 4, 5 and 6 GHz 
for the same conditions as Fig. 3. 
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IMAGINARY IN MHOS /cm2 





Fig. 5—Complex plane plot of yi2 and yx: for Vi = 10 volts and Vz = 2, 6, 10, 14 
volts at 6 GHz, showing the relative insensitivity of y1z and ye: to changes in Ve 
and ge for moderate values of V». 


= _ _ Yi2Ye1 
Y ii —_ Yu Yoo + Ys (5) 


Tuning the second harmonic by adjusting Y. provides the possibility 
of almost any input admittance Y;,,. In particular, | Y.| = © gives 
the short-circuit termination and Y;n, = yi. Equation (5) also pre- 
dicts a pole in Y,,; at the frequency for which y2.2. + Y. = 0. This is 
not an ordinary pole as in linear circuit theory however for two reasons: 
(t) Yor may have a negative real part because it is an active device, and 
(22) Yoo is a function of V. so that the “pole” at yoo + Y2 = 0 moves 
with changing V, . This means that a resonance type of behavior should 
be observed, but that the only condition where y2. + Y. = 0 is for 
V, = 0, which is just the single-frequency oscillator condition at 2f. 
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III. STABILITY OF THE OSCILLATION STATE 


Given an oscillation state which prescribes the admittances at the 
two frequencies, there are two requirements on the circuit that must 
be met in order that this be an obtainable state of steady oscillations. 
These are the requirements of circuit realizability and oscillation-state 
stability. The realizability criterion is simply that the required circuit 
have admittances whose real parts are greater than zero. The stability 
criterion is that any perturbation away from the given state will asymp- 
totically return to the original state. 

The stability problem has been recently discussed by Kurokawa’’’® 
for the single-frequency negative-resistance oscillator. By following the 
approach used by Kurokawa and extending it to two-frequency inter- 
actions, the equations governing the stability of the harmonically 
tuned oscillator are derived in Appendix A. In this section, they are 
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Fig. 6—Complex plane plot of Ky = yi2/Vi and K2 = yu/Vi as a function of Vy 
for various values of second-harmonic phase g2:, at 4 GHz, 
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Fig. 7—Complex plane plot of g12 and J: as a function of the fundamental 
frequency for V; = 10 volts and V2. = 1 volt. 
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Fig. 8—|yi2| and |gei] versus frequency for V; = 10 volts and Vz = 1 volt. 
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_ Fig. 9—The arguments y1, ¥2 and yi -+ we versus frequency, showing a nearly 
linear dependence. 


applied to several special cases, and theoretical examples of their use 
with the 6-GHz germanium oscillator model of Blue are given in 
Section IV. 

In Appendix A, it is shown that the stability of an oscillation-state 
for small perturbations is determined by the solution of the system of 
equations 


de _ 
ait Pe = 9 (6) 


where the vector ¢ is defined as 
6a,/V, 
c= ba2/V2 (7) 
5(y2 — 2¢,)) 


and the matrix B is given by 








Me Gia sin (wy — 4) 
| ah Ages (az + 620) 
=D Sve Gio ore v0 
Ko Vy 
+7 V7 COs (a2 + 420) | | 











Ki Vo V 
¥ [sin (a, + 640) “Ty 7 Cos (a, + 640) ] 
ee a ee AU cos (2 + O20) 
| Y2 | | Yo | 
Mul + 0 Goo cos (a, — va] _ 2k V2 sin (a, + a] 
St Ye | Ree 
— 2k Ve V, | - KoV, . | 
TY] Y7 | cos (a; + 410) TY] Y? bs (a2 + 620) | 


(8) 


S8LT 
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As discussed in the Appendix, 6a,, da, and 6(g2 — 2y,) are the 
perturbations in the fundamental and second-harmonic voltage ampli- 
tudes and the relative phase, respectively. V,; and Vz are the unper- 
turbed values of fundamental and second-harmonic voltage amplitudes. 

The remaining quantities in the B matrix are defined as follows. The 
fundamental and second-harmonic external circuit admittances are 
Yi(wo) = Gio + GBio and Y2(2wo) = Goo + jB2, respectively. The 
primes on Y, and Y, in equation (8) denote differentiation with respect 
to frequency at w) and 2w, respectively. «x, and x, are defined in equation 
(4). 

The saturation parameters s, r and u, v are defined by equations 
(55) through (58) in the Appendix. They relate to the nonlinear satura- 
tion of the diode’s conductance and susceptance at the fundamental 
and second harmonic frequencies, respectively. The significance of s 
and r is shown schematically in Fig. 10, with u and v interpreted by a 
similar diagram for the second-harmonic admittance. 

We have also introduced the angles a, and a, which give the slope 
on the complex plane of the circuit admittances at wo and 2w» 


COS Q, = ea ’ sin a, = ates ) 
VaR + Bio V Gio + Bro 
G0 1 Boy 
aoe ~ N/a? a. Re 1 
ene aaa “Ss av@gamee ° “ne 


and the angles y, and y. which measure the slope of the admittance 
curves ¥i1(V1) and Yo2(V2); 


s 


cos ¥, = Weis ) sin 1 (11) 


| 
I 


r 

Vere 
v 

— ———<—<<——— * 12 

Vw tv 02) 

Also, 6,5 and 629 are defined as in equations (48) and (49) of the Appendix 

but with the phase ¢, set to zero. That is 


U : 
Ces Y2 = OSS ) sm ¥2 
V/u? + v? 


fro = —G2 — 
and | (13) 
O29 = Yo — Wo. 
Note that 
610 + O09 = —¥1 — Wo. (14) 
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Fig. 10—Interpretation of the saturation parameters s and r for the fundamental 
admittance yi. Similar definitions hold for u and v for the second-harmonic ad- 
mittance yoo. 


For the Ge oscillator considered here, the direct relationship between 
G2, 929 and 6,9, as determined from equation (18), is shown in Fig. 11 
for several frequencies. 

The angles a,, 7; and 6, are shown in Fig. 12 which is a plot of the 
negative of an assumed circuit admittance —Y,(w) and the diode 
single-frequency admittance y,,(V;) in the neighborhood of the funda- 
mental frequency. The point of intersection at w, gives the frequency 
and amplitude of the fundamental oscillation with zero second-har- 
monic voltage. As the voltage V. is increased by presenting an appro- 
priate value of Y2(2w.), the frequency will shift to some new value 
w@) generally accompanied by a change in voltage to V,. This shows 
that the current injected into the fundamental circuit by the 4;.V. 
exp(jg2) current source of Fig. 1 is just that sufficient to obtain the 
difference between the admittances —Y,(w.) and y,,(V;). This addi- 
tional admittance may be considered as a vector pointing from y1,(V;) 
to — Y,(wo), and it is the angle 6,. measured clockwise about the y,1(V;) 
point that determines the orientation of this vector. Its length is given 
by |yi2| V2/Vi. The angle a, gives the slope of the circuit curve at 
— Y,(w), and the angle y, gives the relative change in reactive to real 
part of y1:(V,) with increasing voltage V, at the operating point. The 
angles a2, Y2 and 62). may be defined in a similar manner in the second- 
harmonic admittance plane. 

The solution of equation (6) subject to a small initial perturbation 
has a decreasing amplitude with increasing time if the eigenvalues of 
the stability matrix B all have real parts greater than zero. Suitable 
tests have been devised to determine this property.’’ The general case 
is difficult to do analytically and generally difficult to interpret if done 
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numerically because of the large number of parameters of the system. 
This is done however for the 6-GHz oscillator example given in Section 
IV, and the results are compared with the simplified results of this 
section. 

In the remainder of this section, three special cases are examined 
which are severe approximations to the general case, but which yield 
interesting information. The first of these is that of a single-frequency 
oscillator, V2 = 0. The second is the fictitious weak-coupling case 
which does not apply to the germanium diodes modeled here, but is 
included because of simplicity and for completeness. The third case is 
that of a strongly coupled small-signal approximation which gives 
qualitatively most of the features observed from the complete study 





0 0.5 1.0 1.5 2.0 25 
920/7 


Fig. 11—Oscillator phase relations for the 6-GHz germanium example; gy versus 
929 with loci of constant 61) at 3, 4, 5, 6, 7 and 8 GHz. 
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Fig. 12—An assumed fundamental admittance plane plot showing the angles 
ai, Yi and 619. The device admittance is yi:(a:1) and the negative of the circuit ad- 
ra is —Yi(w). A similar diagram defines ae, ye and @29 in the neighborhood 
0 2Wp. 


of the eigenvalues of B, which is carried out in Section IV for the ger- 
manium diode case. 


3.1 Single-Frequency Limit 


In the very special case of V. = 0, only the first and third parts 
of equation (6) remain and they give the conditions 


sin(a, — y1) > 0 (15) 


and 





ae sin (a2 + 02) < 0. (16) 

| V2 | 
These are simply the conditions required for stability of a single- 
frequency oscillator [equation (15)] with the added condition (16) due 
to the coupling to the harmonic. If the coupling to the harmonic, 
Ke, Is zero for Vz = O, equation (16) does not apply. Thus, for the 
single frequency oscillator with V. = 0, the familiar stability relation is 
recovered.” 
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3.2 Weak-Coupling Limit 
For an oscillator having very small «, and Ka, the first two parts of 
equation (6) decouple. This gives 
sin(a; i) >0 (= 1,2) 


which are the single-frequency stability conditions at w) and 2w) for 
j = 1 and 2, respectively. The third equation then requires 


sin(a, + O10) + p sin(ae + O20) < 0 (17) 

where the parameter yp is defined by 
ean Al a 

We may write equation (17) as 

sin(y, + £) <0 (19) 

where & is defined by the equations 
psingé = —[sin(¥; — a) + wsin(Ye — as)] (20) 

and 

pcos = —[cos(y, — a1) — pw cos(f2 — a)]. (21) 


For a given pair of V,, V, and for a fixed circuit, equation (19) thus 
gives the range of ¢2 for stable operation in the weak coupling limit. 


3.3 Small-Signal, Strong-Coupling Limit 


For very small signals the admittances y,, and y22. are independent 
of V, and V, so that s = r = u = v = O provides another approxima- 
tion of some interest, providing that the coupling is still significant. 
In this limit, we obtain four constraints which are necessary and 
sufficient’ to insure that the matrix B have positive eigenvalues. 
These are 


ky = —sin (ay + 940) cx p sin (ay + 829) > 0, (22) 


k, = —sin (a, + 640)-Sin (aa + 40) 

+ 3 cos (a; + 610)- COS (@ + O29) > 0, (23) 
kz = sin (a2 + 609) + wsin (a, + 440) > 0, (24) 
ky = kik, — ks > 0, (25) 


where y is defined by equation (18), 
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The significance of this case is that for » = 1, conditions k, > 0 
and k; > 0 are contradictory. This implies that » = 1 is a critical 
value and is indeed unstable, whereas for » approaching zero or in- 
finity stable states of oscillation do exist. These u «< 1 and up > 1 
stable states are exclusive of each other so that, as the conditions of 
oscillation are changed, if » passes through the value unity a discon- 
tinuity in the oscillation will occur wherein the phase, the power and 
the frequency may all jump suddenly to new values. 

To demonstrate the existence and exclusive nature of the » < 1 
and p > 1 limits, consider equations (22) through (25) Note first of 
all that if a solution is obtained for a given value of y, the solution for 
the reciprocal of that value of » is obtained by interchanging the sub- 
scripts 1 and 2 on the angles a and @. Thus, we need only consider the 
limit » < 1; the limit » > 1 being obtained from symmetry. For p< 
1, equations (22) and (24) yield [Using equation (13)] 


m—-Wta<¢g<2r-yta, (kh > 0) (26) 

and . 
Yo—- 2 << O < T+. — a. (ks > 0) (27) 
respectively. For purposes of illustration we consider a, = a, = @. 


Then the regions defined by equations (26) and (27) may be plotted 
in the g., & plane. From equation (25), if ki, k; and k, are > 0, ky > 
0 is automatically satisfied. Consider the constraint k, > 0, which may 
be written 


—cos(a, + 610)[2 sin(ar + O19 + ae + 420) 
+ sin(a@; + O10 —- Aa— 90) | > 0. (28) 


We see that cos(a,; + 61) = 0 is a critical condition, on either side 
of which the term in the brackets must also change sign. Thus, the 
lines 


go=W—-a&t7/2 (ky = 0) (29) 


in the ¢2, & plane are critical lines. Further, consider cos(a, + 610) > 
0, then 


sin(Y, + ~. — 2%) > —sin(Q2¢2 + ¥1 — e)/2 (ks > O). (80) 


Equation (30) represents a curved boundary in the ¢2, & plane and 
must be computed numerically. In Fig. 13 the regions bounded by 
equations (26), (27), (29) and (30) are plotted. The data used for this 
figure (Y, and y2) were taken from the Ge IMPATT example at a fre- 
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Fig. 183—Regions of stable yg, versus &a1 = a2 = &) in the strongly coupled 
small-signal limit at 4 GHz; u < 1. 


quency of 4 GHz from Fig. 9. Figure 13 shows that, for » < 1, there 
are two disjoint regions. Also indicated are the values of yg, for which 
619 = 0, 7/2, +, 30/2. The angle 6,, (Fig. 12) measures the relative 
location of the diode’s actual input conductance with respect to the 
single-frequency large-signal negative conductance, at the fundamental 
frequency. For —7/2 < 61. < 2/2, COS 419 18 positive and the input 
conductance is less negative than it would be for zero harmonic voltage. 
For this range of 6,) then, the fundamental output power is degraded 
by harmonic tuning. On the other hand, for 7/2 < 6\) < 3872/2, the 
input conductance is more negative than for V. = 0, and the funda- 
mental output power is enhanced by the presence of harmonic tuning. 
These relationships can readily be seen by rewriting equation (1) 


V 6 
Re (Yin) = —gi + | Yrs | : oa se. 
1 


Indeed, 0:1. = mw maximizes the fundamental output power for the 
particular values of V,, V. being studied. We see that at 4 GHz, the 
maximum fundamental power point exists within a stable region for 
uw <1. It is also interesting that the minimum fundamental power 
phase (6;. = Q) is in a separate region which requires a considerably 
different circuit. 
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To obtain the similar diagram for » > 1, the same considerations 
can be reapplied to k, through k,, or the subscripts on g and a can be 
interchanged. Hither way, Fig. 14 shows the result. Comparison of 
Figs. 13 and 14 shows indeed the disjointed, mutually exclusive be- 
havior of the » < 1 and » > 1 regions of stability. Additionally, it 
shows that for a given circuit (i.e., a given &), there are two stable 
ranges of phase g» (if any at all) depending on-the value of yu relative 
to unity. One of these encompasses the 6;, = + maximum power phase 
and the other encompasses the 6,, = 0 minimum power phase. A change 
in the bias current, which does not alter significantly the circuit vari- 
able @, may well change the relative value of » from >1 to <1 or 
vice versa, and such a change would necessitate a change of phase 
to a different branch. Thus, which branch of the stability diagram 
the oscillation state is in is determined by the history of tuning and 
bias current changes. This type of behavior would be observed experi- 
mentally as a hysteresis in frequency or power or both, which if analyzed 
would indicate that the input admittance of the diode at the funda- 
mental frequency is a nonunique function of the fundamental RF volt- 
age. The presence of this effect would be indicated if one were able 
to obtain two different values of power output for the same frequency 
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Fig. 14—Regions of stable g. versus @(a1 = a2 = &) in the strongly coupled 
small-signal limit at 4 GHz; u > 1. 
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by changing the bias current only, without retuning the RF circuit 
in any respect. Observation at a single frequency is required in order 
to rule out the possibility of multiple-valued circuit admittances.”° 

In the next section, we compute the regions of stability for the 
germanium IMPATT example in full generality; that is, we use the 
complete form of the matrix B, equation (8). This must be done nu- 
merically so a limited number of cases can be examined, and the results 
are compared with the approximate forms of this section. 


Iv. 6-GHZ GERMANIUM OSCILLATOR EXAMPLE 


Using Blue’s approximate large-signal analysis,’ the equivalent cir- 
cuit parameters of Fig. 1 have been calculated for a germanium diode 
of depletion layer width 4.75 microns with an assumed avalanche zone 
width of 1.5 microns. This gives a critical field Z, = 1.87 X 10° V/em 
for a bias current density J) = 340 A/cm’, which agrees quite well 
with the value obtained from a more exact numerical treatment. The 
design of this model was an attempt to model the germanium diodes 
reported by Swan’ and by Gewartowski and ‘Morris.’ Because the 
Read theory is slightly incorrect in its reactive effects, the frequency 
of maximum negative conductance was at about 6 GHz for the model 
but appeared to be at about 8 or 9 GHz for the actual diodes. In com- 
paring the results of this work with those of the experiments, it there- 
fore seems most useful to discuss frequency relative to fmax, at which 
maximum output power is obtained. Thus, 4 GHz in this analytical 
work is roughly equivalent to 6 GHz in Swan’s experiments. Table I 
lists the large-signal information obtained from Figs. 2, 8 and 9 that is 
needed for the solution of the stability constraints. This information 
was obtained for V,; = 10 volts and V. = 10 volts, and a de bias current 
density J, = 340 A/cm’. 

It is known that at resonance in a low-loss circuit where the real 
part of the admittance is constant or nearly so, the external Q can be 
written 


— %o dB 
Qext en 2G, dw 


wheve G, is the real part of the admittance at w) and B is the suscep- 


tance. Resonance is defined by the vanishing of B(w,). It is useful 
here to extend this definition to define the slope parameters 


adyY, 
dw 





O=Wo 


Wo 


1 9G 
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TaBLE I—DiopE LARGE-SIGNAL PARAMETERS ATV, = V, = 10 Vouts 
3 GHz 4 GIlz 5 Gilz 6 GHz 
gu. (mhos/cm?) 5.8 12.7 15.4 15.6 
g22 (mhos/em?) 15.6 12.7 8.5 4.3 
oo (mhos/cm?-volt) 0.0 0.22 0.24 0.21 
1 
aby 
av; (mhos/em?-volt) 1.1 0.65 0.30 0.20 
1 
oe (mhos /em?-volt) 0.21 0.125 0.065 0.035 
2 
Ode 
av, (mhos/em?-volt) 0.20 0.0 0.0 0.0 
2 
vy. (mr radians) —0.2089 —0.38056 —0.395 —0.477 
we (2m radians) 0.9031 0.7742 0.6181 0.4798 
x: (mbhos/em?-volt) 1.25 0.86 0.59 0.42 
ke (mhos/cm?-volt) 1.25 0.70 0.385 0.205 


at the fundamental frequency and 


Wo 


_ wo | dYe 
a Goo} dw 


at the second harmonic. If, at w = w) and w = 2w», G/, and Gj, vanish 
respectively, then D, and D, reduce to the external Q’s of the circuit 
at these two frequencies, particularly since the major portion of the 
diode’s susceptance is considered to be part of the external circuit. 

Since, at an equilibrium point, from equations (44) and (46) of the 
Appendix 











W=2Wo 


Gio = gi — k, Vq Cos 
and 
Goo = Jo — KeVi cos 0 , 
specification of the parameters D, and D, permits the calculation of 


| Y‘| and | Y3| from the information of Table I. 
The general stability criteria for the matrix B are as follows: Let 
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B be represented 
a be 
B=\|de fi- 
Go xt 
The condition that the eigenvalues of B all be positive implies that 
ky =ate+i>0, 
kz = ae + et + at — bd — fh — ge > 0, 
kz = det B > 0 
and 
k, = kik, — kg > O. (31) 


These conditions must be checked numerically, and the number of 
independent variables for a general study is quite large. In the calcula- 
tions done here, the circuit variables have been restricted to a; = a, = 
a, with two sets of slope parameters; (7) D, = 50, Dz = 500 and (iz) 
D, = 50, D2 = 10. The restriction on a; and a, is quite artificial but 
allows comparison with the approximately determined regions of Section 
III. The two sets of slope parameters D,, Dz are an attempt to model 
(1) a high Q and (iz) a low Q second-harmonic circuit, respectively, 
and to thereby approximate the two conditions n» «< 1 and uw > 1 for 
the same set of diode data. 

The results of these calculations are shown in Figs. 15 and 16 for 
the frequencies 3, 4, 5 and 6 GHz. These show the values of stable 
second-harmonic phase ¢, as functions of the circuit angles, a, = a, = 
a. These regions repeat themselves with a periodicity of 27 in both 
¢2 and @. Only the principle branches are shown but it should be under- 
stood that wherever one of these regions extends across the boundaries 
chosen, it should be reflected back into the region at the opposite 
boundary. Figure 15 is for the case D, = 50, D, = 500, and corresponds 
to a value of uw < 0.4 everywhere. Figure 16, for which D, = 50, D. = 
10, corresponds to values of » from near or slightly less than unity, to 
greater than 4 to 8 (the only exception is in Fig. 16a where one region 
appears having a value of » ~ 0.02). It should be noted that the value 
of » = 1 is no longer a critical value, inasmuch as stable states may 
now exist for which » = 1. They do not appear to be large in number, 
however, and one may think of » = 1 as a transition value for which 
the area of the stable regions in the gy, @ plane becomes small. 
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Fig. 15—Large-signal regions of stable gz: versus &(a1 = az = &) as obtained 
from the eigenvalues of the complete B matrix for the germanium oscillator example 
at (a) 3 GHz, (b) 4 GHz, (c) 5 GHz, and (d) 6 GHz; circuit variables D; = 50, 
Dz = 500; diode variables Vi; = V2 = 10 volts, Jo = 340 A/cm®. This figure has 
pw < 1 everywhere. 


Consider the 4-GHz results and compare Figs. 15b and 16b with 
Figs. 13 and 14. The locations of the stable regions in the g2, & plane 
show a one-to-one correspondence but with greatly distorted shapes. 
It therefore appears that the strongly coupled small-signal approxima- 
tion used in Figs. 13 and 14, together with the » < 1 and up > 1 cases, 
does give useful information about the general location of these stable 
regions for more realistic cases. The general properties of disjointedness 
and mutual exclusiveness are no longer strictly true (for example, 
there is some overlap of the regions centered at &@ = m in Figs. 15d 
and 16d). However, it is easy to see that tuning discontinuities may 
still occur, and that the circuit angles must be considerably different 
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to obtain oscillation at @,. = a, for example, for the two different sets 
of values of slope parameters considered. 

It is interesting that the angles a, and a, (and therefore, a) are 
equal to +/2 for simple shunt resonant circuits at both w and 2w, and 
that the stability diagrams show no cases of stable operation for this 
condition. Because of the approximations of this analysis, this cannot 
be construed to be a general conclusion, even for the diode modeled. 
It does show however, that such conditions may arise and that obtaining 
just the correct phase relations for maximum output power with a 
given circuit may be extremely difficult. 





“0 0.5 1.0 1.5 2.0 0 0.5 1.0 1.5 2.0 
a/7T a/T 


Fig. 16—Large-signal regions of stable g2 versus &a; = a2 = &) as obtained 
from the eigenvalues of the complete B matrix for the germanium oscillator example 
at (a) 3 GHz, (b) 4 GHz, (c) 5 GHz, and (d) 6 GHz; circuit variables D, = 50, 
Dz = 10; diode variables Vi = V2 = 10 volts, Jo = 340 A/cm? This figure has 
u > 1 everywhere except as noted. 
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Another observation is that the angle @:;. = a for maximum funda- 
mental power output does have a stable realization in almost every 
case examined, even with the restriction a; = ap. 

If the points of operation along the circuit admittance curves Y;(w,), 
Y.,(2w)) are near minima of their real parts, the angles a, and a, are 
restricted to lie in the range 0 < a; < a,j = 1, 2. Such a limitation 
seems to imply different possibilities at the four frequencies calculated. 
At 3 GHz, stability is obtained in the neighborhood of 6:. = 7 and 
only for the D. = 500 case (u < 1). At 4 GHz, stability near 6,, = 7 
is only obtained for the D. = 500 (u < 1) case, but there are additional 
stable states at or near 6,9 = 0 for both the D. = 500 (u < 1) and D, 
= 10 (u > 1) cases. Also, at 4 GHz, Fig. 16b shows a region which 
encompasses the 6,. = 7/2 point which is a crossover between enhanced 
and degraded fundamental power. The 5-GHz cases are very similar 
to those at 4 GHz except that there are more enhanced-power stable 
states for the D, = 10 (u > 1) case than at the lower frequencies. 
At 6 GHz, this shift is more advanced with roughly an equal number 
of stable states in the enhanced power region for the D2 = 10 (u > 1) 
and D, = 500(u < 1) cases. 


V. SUMMARY AND CONCLUSIONS 


An analysis of the stability of the tuned-harmonic mode in IMPATT 
oscillators has been presented using a simplified model of the frequency 
conversion in the avalanche diode. It has been shown that the stability 
constraints are generally quite restrictive and difficult to satisfy, par- 
ticularly for diodes showing strong harmonic interactions. The goal 
of this work has not been to present a set of design curves which insure 
stable tuned-harmonic operation, but rather to consider the difficulties 
which the stability constraints present. 

When the circuit restricts the voltage across the diode to be largely 
sinusoidal, this analysis reduces to that of the stability of a “‘single- 
frequency” oscillator. For nonzero fundamental and second-harmonic 
voltages V, and V2, a characteristic parameter u has been defined 
[equation (18)] which is dependent upon both diode and circuit char- 
acteristics and degree of excitation. The value of » = 1 appears to be 
somewhat critical in that the stable regions for y > 1 and up < 1 are 
usually separate. Any tuning or bias changes which force » to pass 
through unity are very likely to produce sudden changes in the output 
variables, i.e., power and frequency. For example, the single-frequency 
oscillator is destined to have p > 1 because of the small value of V>. 
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However, for equal V, and V. and D./D, ~ 10, » < 0.4. Thus the 
single-frequency oscillator and the tuned-harmonic oscillator (high Q, 
2w» circuit) are likely to operate in different regions of stability. 

The numerical treatment of the stability criteria have been restricted 
to the case where the circuit angles a, and a2 are equal. Thus the 
results presented here cannot be considered complete. However, in the 
example studied, it was found that at an operating frequency two- 
thirds the frequency of maximum output power, the phase ¢. for maxi- 
mum power is indeed stable and also corresponds to a realizable cir- 
cuit. It was also found that it is possible to degrade the output power, 
and therefore, harmonic interactions when improperly adjusted can 
severely lower a diode’s output power from that which would exist 
with no harmonic voltage at all. 

As a necessary part of this instability analysis, a two-port model 
for the interaction was introduced and characterized for the 6-GHz 
germanium IMPATT model presented. This characterization illustrates 
the role of the second harmonic in introducing a “pseudo-pole” into 
the nonlinear admittance of the fundamental, and it clarifies the rele- 
vance of the single-frequency admittance plane characterization for 
the tuned-harmonic mode of operation. 

This analysis also has assumed that y,. and yo, may be described 
by equation (4). If, on the other hand, y;2 and y2; are assumed constant, 
then this analysis becomes identical with that of two nonlinear oscilla- 
tors coupled through a linear circuit. That analysis can be carried 
through in the same manner as presented here. In such a case, the 
weakly coupled case becomes of considerable interest and has been 
treated by Schlosser.” 

It is not necessary, of course, to introduce the two-port model of 
Fig. 1 at all, with its attendant assumptions and approximations, but 
it is possible to consider the perturbation of the oscillation-state di- 
rectly from the numerical solution of the IMPATT equations. This 
would be a more accurate method to pursue; however, it is felt that 
the approach presented in this paper provides insight that might be 
obscured in a more complicated approach. 
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APPENDIX A 


Derivation of the Stability Matrix 


In this appendix, the stability of the oscillation-state is considered 
using a linearized perturbation treatment about any general large- 
signal operating state. The result of this appendix is the derivation of 
the state-equation (6) and the stability matrix B, equation (8). 

Consider a prescribed state of oscillation satisfying the two condi- 
tions 


Y ,(w) ae Yin(Vi , Ve »F1 » G2) = 0 (32) 
and 
Y 2(2wo) ae Y snl Vi ) V2 »F1 » £2) = 0, (33) 


where Y,(wo) and Y.2(2w) are the circuit admittances at w) and 2» 
respectively. An approximation is made that the input admittances 
of the diode, Yin, and Ying, are slowly varying functions of frequency 
as compared with the circuit admittances Y,(w,) and Y2(2w). This is 
facilitated by considering the depletion layer capacitance, for example, 
to be a part of the external circuit. Generally speaking, equations 
(32) and (88) prescribe a functional dependence of w, the frequency 
of oscillation, upon the voltage amplitudes and phases for small varia- 
tions. For small variations in w we can approximate 

ay; 
dw 





Y (wo + 51) wd) Y, (wo) + . 6, 


Wo 





and 


dY. 
dw 





Y2(2u - 52) ~ Y2(2a) ae do. 


200 





The 6, can be determined by allowing the voltage amplitudes and 
phases to be slowly varying functions of time 


v,(t) = a,(t) cos [wot + ¢,(8)] (34) 
and 
v(t) = ap(t) cos [2wot + ¢,(0)]. (35) 


Differentiating with respect to time gives 


d ; vd ld 
on = Re | + ia 7 a, exp [jot + ool} (36) 
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and 
dby _ , dee 4 A dea), expt \ 
The Re | 2s ae di as db dz exp [j(2wot + ¢¥)] (37) 
Thus, we can identify’® 
_ de, bday 
oe dt Ia, dt 
and 
, _ den, Lda 
2°" “at a> dt ’ 
and therefore 
= d¥,| (der _ ; 1 da) 
Yi(@o + 6:) & Vi(wo) + is 3: (de Ia At (38) 
and 
dY 








_ (ej 22) 


Y2(2Qw) + 82) & Yo(2wo) + dt a, dt 
2 


dw 


are the circuit admittances related to slow variations of the amplitudes 
and phases. 

From the equivalent circuit of Fig. 1, the currents at the fundamental 
and second harmonic are 


h(t) = Re {[yna: exp (e1) + yiede exp (je2)]-exp (Jwot) } 
and 
to(t) = Re {[yzia exp (Jo1) + Yooe exp (Jy2)]- exp (72wot)} , 
which may be rewritten using the assumptions (4) as 
a(t) = [—gia, + K,a,a_ cos(2¢; — v2 — W)] cos(wot + ¢,) 
+[—b,a, + x,a,a, sin(2¢, — ge — )] sin(wot + ¢1) (40) 
and 
to(t) = [—Gode + Koa; COS(Y2 — 291 — W2)] COS(Zwot + ge) 
+[—bed. + xa} sin(gs — 20; — we)] sin(Qwot + ¢o). (41) 
Here we have introduced 


Yu = -H + qb, 
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and 
Yor = —Go + jbo. 
Kirchoff’s laws for the nonequilibrium case are 


a(t) + Re { Yi@,)ai exp (jg1) exp (jal) } 0 (42) 


and 


I 


to(t) + Re { Y2(w2)az exp (jez) exp (72wol) } 0, (43) 


where w, and w. are the perturbed fundamental and second-harmonic 
frequencies. 

Equations (40) and (41) with (42) and (43) give the following four 
differential equations for the quantities a,(é), a2(t), ¢,(t) and ¢.(t) 


G.-g t+ Citar A + Bi : aa = —K,A, COS 4, , (44) 
Bie bye By 2 Hy Cl a ae ee (45) 
Go — got Ge e + BS - oe = —Kpll, COS 02 , (46) 
— (By + by) — BS + Gf * dae — psa, sin Os. (47) 


Here we have defined Y, = G, + 7B,, Y2 = G2 + 7B, and the primes 
denote differentiation with respect to w. Also 


i, = 29, —~ Yo ~— Vi (48) 
and 
92 = 2 — 29, — Yo. (49) 


Equations (44) through (47) may be rewritten so as to contain only 
a single time derivative in each 


21 da, 
a, dt 


= —1,a,[B{ cos 0, + Gi sin 6], (50) 


BG, — 9.) — Gi(B, + b,) + | Yy | 


GiGy = 91) ae ByB, =f b,) ss | Yi 2 os 


= —x,a,[G{ cos 6, — Bj sin 4], (51) 
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2 1 daz 
a, dt 


= —x,a,[B) cos 6, + Gi sin 4}, (52) 


Bx(G, — 92) — GiB. + be) + | Y3 


Gi(G@2 — go) + BYBz + b.) + | ¥2 ? te: 


= —x,d,[(G cos 6. — Bi sin 6]. (53) 


Since ¢; is an aribitrary quantity with no physical significance, it can 
be eliminated in favor of the difference phase ¢. — 29, since this appears 
in both 6, and 62. This is done by multiplying equation (51) by 2/| Y% |’, 
equation (53) by 1/| Y4|? and subtracting equation (51) from (53), 
giving 


d 
dt (¢2 — 29) 
5 G3(G2 — ge) + BiB. + be) _ 9 GiG, — 9) + BUB, + by) 
Peal: ae 
= ~ ran, F208 aS B; sin ) Bie 2eyas( © cos 6, — By sin a), 








| Y2 |’ eal 
(54) 
Equations (50), (52) and (54) form the set of differential equations 
for a,(t), a2(t) and g(t) — 2¢,(t) which will be linearized for small 
perturbations around the oscillation state. These perturbations take 
the form 


Qy 


Vi + 6a, , 
a. = V,+ 6a, , 
and 
ga — 261 = G20 — 210 + S(¢2 — 2¢), 


where V,, Vo, gio and go are the unperturbed values of a,(t), ae(t), 
gi(t) and ¢,(t). The perturbations in the voltage amplitudes will change 
g1, 01, Jo, b2 away from their values 9,, 61, G2, 62 which correspond to 
6a, = 5a, = 5(¢2 — 2¢,) = 0. Thus, we define the saturation parameters 
8, 7, u, v which describe the linearized variation of g, around @,, etc., 
by the equations (see Fig. 10) 


5(Gio — 91) 


ba, (55) 


pee 
Cro 
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_ Vi 5B + by) 


if Gio 6a, : (56) 
_ V2 5(Geo — 92) 
— G., ba, ; (57) 
and 
as V2 5(Bao te bs) (58) 





7 Goo dae 


where the zero subscript on the circuit variables denotes their evalua- 
tion at wo or 2wy as appropriate. 

Equations (50), (52) and (54) may now be cast in a simple matrix 
form 


“+ Be =0 (59) 
where the vector ¢ is defined as 
da,/V, 
= 5a2/V2 (60) 
52 — 29,) 


and the matrix B is given by equation (8) of Section III. Equation (59) 
indicates that the perturbations decay with time, giving a stable state 
of oscillation, if the eigenvalues of the matrix B are all positive. 


APPENDIX B 


. Inst of Symbols 


Gy , Ae Slowly varying amplitudes of the fundamental and second- 
harmonic voltages; equations (84) and (35). 

B Stability matrix; equation (8). 

By, B; Fundamental and second-harmonic external circuit suscep- 
tances; following equation (47). 

b, , b. Imaginary parts of y,, and Yoo, the susceptances of the 
single-frequency oscillator admittances; following equa- 
tion (41). 

D, , De Fundamental and second-harmonic external circuit slope 
parameters; Section IV. 

G,, G, Fundamental and second-harmonic external circuit con- 


ductances; following equation (47). 
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91 > 92 Negative of the conductances of the single-frequency os- 
cillator admittances; following equation (41). 

Cee Complex normalized form of yi. and Y2i; equation (4). 

o7 Saturation parameters for the admittance y,,; equations 
(55) and (56). 

U, V Saturation parameters for the admittance y..; equations 
(57) and (58). 

V,, V2 Fundamental and second-harmonic voltage amplitudes; 
preceding equation (1). 

aoe & Fundamental and second-harmonic external circuit admit- 
tances; Fig. 1. 

Y int » Ying Fundamental and second-harmonic IMPATT diode input 
admittances; equations (1) and (2) and Fig. 1. 

Wis ae Fundamental and second-harmonic “single-frequency” os- 
cillator admittances; Fig. 1. 

Wisco tak Conversion transfer admittances between fundamental and 
second harmonic; Fig. 1. 

Bio 4 Hoi Approximate form of y:2 and Yo; equation (4). 

ay, Os Fundamental and second-harmonic circuit admittance 
slope angles; Fig. 12. 

11,2 Fundamental and second-harmonic single-frequency diode 
admittance slope angles; Fig. 12. 

0, , 92 phase variables; equations (48) and (49). 

Gin One 6, and 6, for g, = 0, equation (13). 

Ky, Ke Magnitudes of K, and K, ; equation (4). 

a Stability parameter, equation (18). 

1, Ge Fundamental and second-harmonic voltage phases; pre- 
ceding equation (1). 

Vi, We Arguments of K, and Kz ; equation (4). 

Wo Fundamental radian frequency. 
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An Analysis of Adaptive ‘Retransmission 
Arrays in a Fading Environment 


By Y. 8. YEH 
(Manuscript received December 3, 1969) 


We analyze in this paper the performance of adaptive retransmission for 
amproving two-way communication between antenna arrays in a randomly 
fading environment. 

For a stationary environment, S. P. Morgan has shown that complex 
conjugate retransmission reaches a stable state and maximizes the signal- 
to-noise ratio of a maximal ratio diversity reception system. We show that 
a simpler system using phase conjugate retransmission will also stabilize 
and maximize the SINGIN Oe ratio of an equal gain diversity reception 
system. 

Where the fading is slow in comparison to the system settling-down tame, 
both systems provide a significant improvement in transmission. 

Subject to Rayleigh fading, we have obtained the average signal strength 
and tts cumulative probability distribution for vartous combinations of 
numbers of antennas in the two arrays for each of the above mentioned 
systems. This information is useful in choosing an optimal division of 
diversity branches for the two antenna arrays. It 1s further observed that 
although the phase conjugate retransmission system 1s much simpler to 
umplement, tts performance is only slightly inferior to the corresponding 
complex conjugate system. 


I. INTRODUCTION 


Adaptive antenna arrays have been the subject of numerous in- 
vestigations.1* In an adaptive transmitting array, the individual 
element is excited according to information derived from the incident 
pilot field. For example, in a complex conjugate system, the excitation 
currents are proportional to the complex conjugate of the incident 
voltages while the total power radiated is kept constant. In a phase 
conjugate system, the currents are kept constant while the phases 
are adjusted according to the conjugate phase of the incident voltages. 


1811 
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In a free-space environment, that is, plane wave incident from 
a particular direction, it is well known that phase reversal would 
steer the radiated beam toward the source antenna. Cutler and others? 
have shown how phase reversal can be achieved by frequency con- 
version of the pilot signal. 

The role of adaptive retransmission in a multipath fading environ- 
ment, for example, mobile radio, tropscatter communication, and so on, 
has received far less attention. Still unanswered is the question of 
whether the phase conjugate or the complex conjugate retransmission 
schemes could improve the communication link and reach a stable 
state. In his work, 8. P. Morgan has shown that, in a stationary 
arbitrary environment, stable state and maximal power transfer can 
be achieved by complex conjugate retransmission.® 

In this paper, we show that the much simpler phase conjugate 
system will also reach a stable state. Furthermore, assuming equal 
amplitude transmitting currents on the antenna elements, the sum- 
mation of voltages received at one array is equal to that of the other 
array and is maximized. Consequently, the phase conjugate retrans- 
mission system will maximize the signal-to-noise ratio (S/N) of an 
equal gain diversity reception system.t 

In general, the fundamental differences of the two retransmission 
schemes are that the phase conjugate retransmission maximizes the 
sum of the amplitudes of the voltages received and the complex 
conjugate retransmission maximizes the total power received. 

Where fading is slow in comparison to the time required to reach 
an equilibrium state, both systems could be used to improve the 
quality of a fading communication link. 

We investigate the performance of these two systems in actual 
fading environments. In particular, we want to know how these two 
systems differ in average S/N, what the S/N probability distributions 
are, how much they improve fading statistics over a single branch 
system and, finally, what the optimal division of number of antennas 
would be between the two antenna arrays. 

In order to answer these questions, we must first establish the 
characteristics of the medium which links the two antenna arrays. 
For example, in a mobile radio the signal received by a single antenna 
is rapid varying and can be characterized by Rayleigh statistics over 
distances of a few hundred wavelengths.’ However, over an extended 
range of observations, other large-scale phenomena such as distance 
variations, shadowing, and channeling by streets will produce slow 
variations of the average signal strength received. The adaptive 
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retransmission system per se can reduce the rapid fluctuations but will 
be of little help in reducing those long-term variations. Consequently, 
the comparison of the performance of adaptive retransmission ar- 
rays will be based on their relative effectiveness in reducing the 
rapid Rayleigh fading. 

The Rayleigh fading is also an excellent approximation in other 
communication systems such as long-range UHF and SHF tropospheric 
transmission,* and so on. Furthermore, results obtained from Ray- ' 
leigh fading can give significant insight into the performance of 
adaptive antenna arrays under other fading conditions. 

Based on Rayleigh fading statistics, we investigated the cumulative 
probability distribution (CPD) of the signal strength of an m:n array 
system. By m:n we mean that there are m antennas at station 1 and 
m antennas at station 2. The analysis is done by the Monte Carlo 
method on a digital computer. The 99 percent reliability level* as 
well as the average signal strength for a unity transmitter power are 
obtained. It is interesting to note that with the help of interpolation, 
in most cases, only 96 computer samples are sufficient to yield a 
CPD which is accurate up to a few tenths of a dB for all the infor- 
mation we need. 

The average S/N of the two retransmission schemes are compared. 
It is observed that although the phase conjugate system is much 
simpler to build, it is only slightly inferior to the complex conjugate 
retransmission system. 

For other types of fading distributions, the techniques described 
here can readily be applied. 


It. ANALYSIS OF THE PHASE CONJUGATE RETRANSMISSION 


The configuration of the arrays is depicted in Fig. 1. The open cir- 
cuit voltages and the transmitting currents in each array are rep- 
resented by column vectors with the time factor exp (jwt) suppressed. 
The mutual couplings are neglected and the antennas in each array 
are assumed to be identical, with input resistance R during trans- 
mission and admittance G during reception. 

The transmitting current vector Iz at array 2 produces the received 
voltage vector at array 1, 


Vi = CTh (1) 
where T is an m X n matrix whose elements are proportional to the 


* The 99 percent reliability level is defined such that for 99 percent of the time 
the signal strength is above this level. 


1814 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1970 


7 
“| 


| 
7 
4. 4 


ARRAY 1 ARRAY 2 
m ANTENNAS N ANTENNAS 


Fig. 1—Arrays in adaptive retransmission system. 


transmission between a particular pair of antennas. The real constant 
C stands for the average transmission loss. 
By reciprocity, the received voltage at array 2 is, 
Va. = CI'l, (2) 
where the superscript ¢ stands for the transpose of the I matrix. 
Here according to our definition of phase conjugate retransmission, 
the elements of J, and I; are of unity amplitudes although their phases 
could be different. Multiplying equations (1) and (2) by J, and ko, 
respectively, we obtain the following 
(V, , 1,) = CTL, eee (3) 
(V2 ’ I) a CIT, ’ I) (4) 
where the brackets ( ) stand for inner product. Equations (3) and 
(4) are equal, and we obtain the following reciprocity relation 


Wi, hh) = We, 1). (5) . 


2.1 Stabilization of the Phase Conjugate Retransmission System 


Let array 1 be excited initially with current [, which produces V2 at 
array 2. And let array 2 be excited with J, which produces V, at array 1. 
Equation (5) holds and we have the following 


> Wid = ye Voila: (6) 


t=1 


where the subscript 7 stands for the 7th element of the array. 
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Consider now the excitation at array 2. Since the J2,’s are of unity 
amplitude, the quantity >°”., Vo,J2; can be maximized by choosing 
I}, to be phase conjugate to V2, .. We shall call this real maximum quan- 
tity \. Let V? be the voltage vector produced by I/,; ; then we have 


o2 Vili = » V2,13; = > Vo: 
i i=1 i=l 


t=] 


= . (7) 





Let us now consider the excitation of array 1. Obviously the quantity 
71 Vi,t1; can be maximized if we choose J{,; to be the phase conjugate 
of V{, . It then follows that 


> Vis = SV] = Bd. (8) 
i=l i=1 


Let V3; be the voltages produced by I’, . We obtain, by applying equa- 
tion (6), the following, 


SS Vili; = = Vi, =)’ = r. (9) 
i=1 i=1 
Now J3/ can again be chosen to be phase conjugate to V3; and we obtain 
Dd, Valet = DO IVa) = v2 2D. (10) 
i=1 t=1 


This process continues with each new choice of J representing the actual 
retransmission adjustment made by the antenna system. It is obvious 
from equation (10) that each retransmission yields a new value of \ 
which is real and bigger than or equal to the previous value. However, 
because of the finite number of antennas involved, \ cannot increase 
indefinitely. The iteration process must therefore finally settle down to 
a value \; which no longer changes. If this is so, we have 

PS Vie 2 = » Vil: = Ny. (11) 

i=1 i=1 . 
The fact that A, is real, and also that we cannot vary the phase of 
If; and If; to make ), larger automatically guarantees that I{, and I, 
are phase conjugate to V{; and Vi, , respectively. In this case, our phase 
conjugate retransmission apparatus will no longer change the phases of 
Ij; and Ij; because they have already reached their proper value. 
Therefore, we have arrived at a stable state. In this case equation (11) 
can be further simplified to . 


PS [Viel = a |Ves| = Ay. (12) 
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So far we have demonstrated that each retransmission tends to in- 
crease A and a stable state must finally be reached. It still remains 
to be shown that this stable state yields the absolute maximum  . 
It is quite possible that several pairs of I, and I, exist such that they 
are phase conjugate to V, and V2 but their corresponding Aj’s are 
different. This is similar to the existence of different eigenstates in 
matrix analysis. As is well known in matrix algebra, unless the 
initial vector is orthogonal to the maximum eigenstate, we would in- 
variably obtain the maximum eigenstate through iterations. 

Since the phase conjugate operation on V to produce J is a non- 
linear operation, an analytical analysis along the above lines is ex- 
tremely difficult, if not impossible. However, in the next section we 
show with computer simulation that the phase conjugate retransmis- 
sion process converges rapidly and the probability of ending up in 
a@ nonmaximum state of A, is practically zero. 


2.2 Computer Simulation 


The convergence test was done by choosing a3 :4 array system asa 
particular trial case. We started by arbitrarily choosing a IT matrix, 
which was defined by 1,7 = I/1.2 + J/2 —1+4+ j[J/2.8 + 2 — J/1.2]. 
The initial values of J, were chosen such that, 


I, = [1, exp(j6), exp(j)]. (13) 


The phase angles 6 and ¢ were allowed to run through 0 to 27, in 10 
equal steps. Therefore, we had 100 different initial trial values of J, . 
For each initial set of J, , we calculated V. produced and formed J2 which 
produced V, . J, was then readjusted according to the V; just produced. 
In each retransmission, we also computed the quantity \. It was ob- 
served that in all these one hundred trials, the currents and \ approached 
their specific final values within a few retransmissions. For this particular 
choice of I, \, = 31.8719. The first value of \ obtained, that is, 
> i-1 | Ve: | , was always smaller than d, but after the first retrans- 
mission, it invariably came very close to \, . For example, in one case 
the first \ was 10.72; after retransmission at array 2 we obtained a ) of 
30.73 at array 1. After this array retransmitted back to array 2, the 
value agreed with \, to the fourth decimal place. 

Next we tried to determine if \, is the absolute maximum. Jn other 
words, we wanted to check if \, was bigger than the ), that is, }73.1 Vox , 
produced by any arbitrary J, . This survey was done by varying @ and ¢ 
in 50 steps from 0 to 27. Computation indicated that all the 2500 values 
of \ produced were smaller than \, and that \, was indeed the real 
maximum. 
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A similar test was performed on a 4 : 5 array system and we obtained 
similar results as reported for the 3 : 4 system. In the 4 : 5 array system, 
the T',, were defined as J — J)/3 + 7J/6—5+ 9[(2 ~-P? + J)/14 + 
3.5]. 


III. SIGNAL-TO-NOISE RATIO 


Let Vi; be the voltage response at the 7th elementary antenna. 
Furthermore, let 7,; be the corresponding noise voltage which satisfies, 
= 2 - 
(mii) av =a N aaa (14) 
0 tj 
where the ( ),, stand for time average. 


3.1 S/N of Phase Conjugate System Using Equal Gain Diversity Com- 
bining Technique 


The S/N of an m-branch diversity equal gain system is, 


S/N = b> val / mN? = 3/mN?. (15) 


Recall that there are n elements at the other array, which radiates a 
total power to the amount of nR, therefore the S/N of the received 
signal per unit power radiated is, 

S/N = \X5/nmN°R. (16) 
It is therefore obvious that the S/Ns at both arrays are identical. 


3.2 S/N of Complex Conjugate System Using Maximal Ratio Diversity 
Combining Technique 


The excitation currents of a complex conjugate retransmission sys- 
tem are related to the incoming voltages by, 


I, = K.Vs ) (17) 
I, = Kiv% (18) 


where K, and Kez are scalars to keep the total radiated power constant. 
For unity transmitter power, the recelved power at arrays 1 and 2 
are maximized and are equal,’ 


G Ae 

a 19 
RC hm (19) 
where A,, is the maximum eigenvalue of the hermitian matrix II”. 


Pip = Por = 
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The validity of equation (19) is subject to the constraint that when 
the adaptive retransmission array starts operation, its current vector 
should not be orthogonal to the maximum eigenvector of the II” 
matrix. The S/N of a multibranch maximal ratio reception system 
then is, 
2 
S/N = adn. (20) 


It can be seen that the S/Ns at both arrays are equal. 


IV. EVALUATION OF THE CUMULATIVE PROBABILITY DISTRIBUTION 


The complexity of the quantities A» and A; makes a closed form 
solution of the CPD extremely difficult, if not impossible. Therefore, 
we try instead the Monte Carlo method and aim at a numerical 
solution. The essence of the method is to choose for each element of 
the T matrix a random variable of the form u + jv. The variables 
wu and v, according to our assumption of independent Rayleigh fading 
statistics, are normalized independent gaussian variables. For a par- 
ticular m:n array system, we can therefore evaluate the maximum 
eigenvalue Xm by repeated matrix multiplication.° The value A; is 
evaluated by iterations according to the retransmission schemes defined 
in Section 2.2. 

The computed values of Am and A; are stored. Then we start the 
whole process again by choosing elements for another I matrix and 
evaluate the corresponding Am and Ay. The CPD curves are developed 
after a sufficient number of calculations. 

Two tests of convergence are made. The first is the comparison of 
the calculated CPD curves of variables | wi + jvi | or | uw + jv1 |? + 
| uz + jv |? to that of the known theoretical curves. It is understood 
here that w’s and v’s refer to independent normalized gaussian random 
variables. Hence, these curves represent respectively the CPD of 
maximal reception of single or two-channel Rayleigh signals.* 

The results are presented in Fig. 2. A close look at Fig. 2 indicates 
that as far as the 99 percent reliability and the average signal levels 
-are concerned, 900 sample points are sufficient for a single Rayleigh 
and 300 sample points for two Rayleighs. 

A second test is made on the 2:2 and 2:4 antenna system and is 
shown in Fig. 3. The dB scale is chosen such that the average S/N 
of a single Rayleigh variable, that is, the received S/N of a 1:1 array 
system, is at 0 dB. It is observed that 96 samples are already sufficient 
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Fig. 2—Comparison of Monte Carlo method and theoretical calculation, A, 
300 samples; x, 900 samples; 0, 1800 samples; _______., theoretical curve. 


to yield what we want since these points lie very close to the curve 
drawn through the points computed from 900 samples. With the re- 
quired sample points greatly reduced to this number, it is possible 
to make a fast and inexpensive check of an extensive combination 
of m:n arrays. 
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Fig. 3—Comparison of 96 and 960 samples. Complex conjugate retransmission 
maximal ratio reception. 0, 960 points; x, 96 points; _—.—W_, curve fitted to 
960 points. 
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V. DISCUSSION OF NUMERICAL RESULTS 


We look at the complex conjugate retransmission system first. 
Incorporated with maximal ratio diversity reception, this system 
provides the best S/N performance obtainable from a particular 
m:n array system. 

The average S/N is presented in Fig. 4. It is seen that for small 
numbers of n, there do exist appreciable improvements in average 
signal level as m changes from 1 to 4. However, as n increases the 
advantage diminishes. For example, a 1:50 array has the same average 
signal level as 2:44, 3:39, and 4:35 arrays. This is in sharp contrast 
to the case of adaptive arrays with nonfading signals. In that case, 
plane wave incidence is assumed and an m:n array would have the 
same S/N as a 1:mmn array (Fig. 4). 

A simple explanation of the difference between the fading and the 
nonfading arrays is the following: In both cases, the 1:mn adaptive 
retransmission system guarantees that the voltages produced by the 
mn elements at the single array add in phase. In the m:n system, the 
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Fig. 4—Average S/N of complex conjugate retransmission arrays. 
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Fig. 5—99 percent reliability level. Complex conjugate retransmission maximal 
ratio diversity reception. 


voltage components produced by the n antennas again add in phase 
at each antenna of the “m” array if plane wave incidence is assumed. 
Consequently, the power received is identical to that of the 1:mn 
array. However, in a random environment the n voltages components 
at each antenna element in the m array no longer add in phase; there- 
fore, the m:n system receives less power than that of the 1:mn system. 

With reference to Fig. 2, we notice that for 99 percent of the time, 
the single Rayleigh signal has a value above —20.6 dB; we will 
designate —20.6 dB as the 99 percent reliability level. Hence the dif- 
ference in dB values of two antenna systems for a particular re- 
liability indicates their difference in signal threshold or their difference 
in the required transmitter power. The 99 percent reliability level 
is presented in Fig. 5. We next define fading range as the dB difference 
between the average S/N and the 99 percent reliability level. There- 
fore, fading range should provide a good indication of the smooth- 
ness of the received signal. The fading range is presented in Fig. 6. 
It is seen that as n increases, the 99 percent reliability level ap- 
proaches the average signal level. In other words this means that as 
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FADING RANGE IN DECIBELS 
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_ Fig. 6—Fading range of an m:n array system. Complex conjugate retransmis- 
sion maximal ratio diversity reception. 


the number of diversity branches increases, the fading range starts 
to diminish. Figure 7 presents the CPD of a 4:32 array system. We 
note that the CPD curve is extremely flat and the signal level varies 
within a +1 dB range, indicating a greatly reduced fading range as 
compared to either Figs. 2 or 3. 

We discuss now results obtained from the phase conjugate retrans- 
mission system. In this system, as was discussed in Section II, the 
S/N, of an equal gain diversity reception system is maximized. It is 
observed that because of this maximization effect, the performance 
of the phase conjugate system is not much inferior to that of the 
complex conjugate system. For example, the CPDs of the S/N for 
both systems in the case of a 2:4 array system are presented in 
Fig. 8. The CPD curves of the two systems differ approximately by 
the average S/N difference. Therefore, the difference in average S/N 
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Fig. 7—CPD of a 4:32 array system. Complex conjugate retransmission maximal 
ratio diversity reception. 


of the two systems is also a good indication of their difference in 
percentile reliability levels. 

The average S/N of the two systems is shown in Fig. 9 for 2:n 
and 4:n array systems. It is seen that for the same m:n array, the 
difference of the two systems is small, that is, within a dB or so. 


VI. CONCLUSIONS 


We observed that in a fading environment, both complex conjugate 
retransmission and phase conjugate retransmission systems are capable 
of reaching a stable state and yield optimum results by greatly in- 
creasing the S/N at the receiving stations. 

The performance of these two systems differs little. Therefore the 
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Fig. 8—CPD curves of a 2:4 array system. 0, complex conjugate retransmis- 
sion with maximal ratio reception; x, phase conjugate retransmission with equal 
gain reception. 
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Fig. 9—Average S/N of antenna systems 2:n and 4:1 —______. __ ; 
complex conjugate maximal ratio reception; —____, phase conjugate retrans- 
mission equal gain reception. 





choice of a particular scheme should be based on practical con- 
siderations. For example, in the phase conjugate system, the total 
power is divided equally among all the antenna elements. On the other 
hand, the complex-conjugate retransmission system requires that the 
total power be distributed in a complicated fashion. In practice this 
means that each antenna-feeding appartus must be equipped to 
handle power far exceeding that of the phase conjugate system. 

In view of the simplicity of the phase conjugate retransmission com- 
pared to the complex conjugate retransmission (which must keep the 
total power transmitted constant), and only slightly inferior per- 
formance, the former appears to be a more attractive system. 

As far as the division of diversity branches is concerned, it can 
be seen from Fig. 4 that for small numbers of antennas, 
an m:n array would have similar performance to an mn:1 array. 
However, as the number of elements involved becomes larger, this 
relation no longer holds. For example the performance of a 4:n array 
would approach a 1:n array as n increases indefinitely. 
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Microwave Line-of-Sight Propagation With 
and Without Frequency Diversity 


By W. T. BARNETT 
(Manuscript received May 5, 1970) 


Amplitude measurements were made for 68 days in 1966 for seven 
4-GHz and 6-GHz signals on a typical radio relay path. Identical mea- 
surements were also made for one 4-GHz signal on a second path having a 
common reception point with the first path. We present the results from an 
analysis centered on the fade-depth distribution for fades exceeding 20 dB. 
The more significant results are: 


(4) The fade-depth distribution for all single (nondiversity) channels 
ina 5-10 percent band on the same path are essentially the same. Further, 
the distribution has the Rayleigh slope. 

(iz) The single-channel fade-depth distributions differ for 4 and 6 GHz 
on the same path; the distributions also differ for the same 4-GHz frequency 
on adjacent paths with a common reception point. 

(itt) One-for-one frequency diversity can be characterized during multi- 
path fading periods for either the 4- or 6-GHz bands by the ratio of two 
quantities. The first is the precent frequency separation between diversity 
components. The second is the nondiversity fade-depth distribution. 


I. INTRODUCTION 


Line-of-sight microwave systems are affected by multipath propaga- 
tion. When this phenomenon is present, the output from a receiving an- 
tenna can be practically zero for seconds at a time. Experimental data 
are difficult to obtain because long time periods of continuous coverage 
are needed to observe sufficient fading activity at the fade depths (80- 
40 dB) of interest for high performance systems. The literature is ex- 
tensive on this general topic’~’ but limited and in some cases contradic- 
tory’ for these fade depths. The results available regarding frequency 
diversity are even more limited’. For these and other reasons, an extensive 
experimental program was undertaken in 1966. 
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Continuous amplitude measurements were made for 68 days at a rate 
of 5 samples per second per channel for seven 4-GHz and six 6-GHz 
signals on a radio relay path at West Unity, Ohio. Identical measure- 
ments were also made for one 4-GHz signal on a second path having a 
common reception point with the first path. Here a 68-day summer 
period (July 22 to September 28) in 1966 has been subjected to detailed 
analysis. 

We present the results of the data analysis and their interpretation 
along with pertinent background information. Briefly the order of 
presentation is (z) experiment description, (27) determination of the 
reference values used for calibration, (7/7) nondiversity results, (iv) 
frequency diversity results, (v) a mathematical description of pairwise 
fading which is used to interpret the improvement obtained from fre- 
quency diversity, (vz) 4/6 GHz crossband results, (viz) adjacent hop 
results, and (viiz) a comparison of space and frequency diversity. 


Il. SUMMARY 


New results have been obtained from the data concerning 4- and 6-GHz 
propagation on line-of-sight paths. The present analysis was centered 
on the fade-depth distribution for fades of 20 dB or more. A simplified 
listing of the significant findings follows. 


(z) During nonfading conditions, the received microwave signal 
power was constant for the entire test period to within +1 dB 
including equipment variations. 

(iz) The fade-depth distributions for all single (nondiversity) chan- 
nels in a 5-10 percent band are essentially the same and have a 
Rayleigh slope. 

(iz) The single-channel fade-depth distributions differ for 4 and 6 
GHz on the same path; the distributions also differ for the same 
4-GHz frequency on adjacent paths with a common reception 
point. 

(iv) The performance of a one-for-one frequency diversity system 
can be specified for either the 4- or 6-GHz bands by the ratio 
of two quantities. The first is the percent frequency separation 
(100 Af/f) between in-band diversity signal components. The 
second is the experimental nondiversity fade-depth distribution 
P(L). In these terms the improvement (J) of a diversity system 
relative to the nondiversity system as obtained from the data is 
simply 


= Af 
I = 0.18% / Pa). 
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This model is based upon the in-band frequency diversity data 
and is in agreement therewith. 

The factor J characterizes frequency diversity during multi- 
path fading periods. As such, it should be applicable to different 
climates and terrains for path lengths of approximately 28 
miles. 

(v) The improvement from 4/6 GHz crossband diversity was not 
significantly better than in-band diversity of 2 percent or more 
separation. 

(vt) Adjacent section diversity with a common point (as based on 
data on a single channel) was not significantly better than in- 
band frequency diversity. This raises some provoking (unan- 
swered) questions about the correlation of selective fading on 
adjacent routes, for example, limitations on the maximum pos- 
sible diversity improvement to values less than those expected 
from independent fading. 

(vii) The performance of space diversity’® is comparable to that of 
one-for-one frequency diversity on the same hop. 

(vitt) The polarization of the radio signals had no noticeable effect on 
the amount of fading. 


These results are presented in detail along with the necessary back- 
ground information in the following sections. 


III. EXPERIMENT DESCRIPTION 


The transmitted power in microwave radio systems is constant. Prop- 
agation data can therefore be obtained from in-service systems without 
interfering with their operation by using suitable monitoring equipment. 
Such equipment (MIDAS*) was installed at West Unity, Ohio, to monitor 
and record the received envelope voltages of standard TD-2 (4 GHz) 
and TH(6 GHz) signals. A list of the channels is given on Table I. 
Briefly there were seven 4-GHz, six 6-GHz, and two space-diversity 
channels on one hop and one 4-GHz channel on a second hop. A functional 
block diagram is shown on Fig. 1. 

West Unity, Ohio, was chosen as the measuring site for this experiment 
because it lies along a major route in an area with a reputation for con- 
siderable fading. Further, the hops measured have average lengths (28.5 
and 29.4 miles) with negligible ground reflections. The two paths differ 
in azimuth by 68 degrees and their profiles are given on Figs. 2 and 3; 
clearance is adequate even for the extreme case of equivalent earth radius 
(k) equal to two-thirds. 


* An acronym for Multiple Input Data Acquisition System. 
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TasBLE I—Rapio CHANNELS MiAsuRED at West Unity, OHI0 
From Pleasant Lake, Indiana (28.5 mi) 








Channel No. Frequency Antenna Polarization 
4-7 3750 Horn V 
Reflector 
4-1 3770 H 
4-8 3830 Vv 
4-2 3850 H 
4-9 3910 V 
4-11 4070 V 
4-6 4170 H 
6-11 5945.2 H 
6-13 6004.5 H 
6-14 6034.2 V 
6-15 6063.8 ise 
6-17 6123.1 H 
6-18 6152.8 Vv 
6-UD 6152.8 Upper Dish V 
6-LD 6152.8 Lower Dish V 


From Paulding, Ohio (29.4 mi) 


4170 Horn Vv 


Reflector 


4-6 











Note: The 4-X channels correspond to standard TD-2 radio system signals; 
6-X corresponds to TH. 


The MIDAS equipment derived received signal strength information 
by sampling the voltage of the 70-MHz IF signal at a point where it was 
linearly related to the RF signal. At any instant the particular channel 
being measured was selected automatically by MIDAS. A common 
detector then converted the IF amplitude measurement to a de voltage 
which was quantized into one of 32 contiguous steps over a 45-dB range. 
The MIDAS input-output curve is given as Fig. 4. 

The data were recorded on paper tape along with the necessary timing 
information. Measurements were made throughout the 68-day period 
at a rate of 5 samples per second on each channel. The information was 
recorded for all channels at rates of either 1 sample per 30 seconds, 1 
sample per 2 seconds and 5 samples per second (normal, intermediate, 
and fast rates) depending on the fading activity of the channels under 
test. The recording rate was automatically selected by MIDAS so as 
to record all significant fading. During computer processing of the data, 
the amplitude value at a sampling instant was assumed to hold until the 
next sampling instant. 
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Fig. 1—Experimental layout, Pleasant Lake to West Unity (Paulding to West 
Unity not shown). 
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Fig. 2—West Unity—Pleasant Lake path profile. 
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Fig. 3—West Unity—Paulding, Ohio, path profile. 
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Fig. 4—MIDAS calibration curve. 
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An important feature of the experiment was long-term continuous 
coverage. Deep fades are rare events occurring at unpredictable times; 
the test equipment had to be on-line continuously to obtain an adequate 
sample. 

The required test equipment reliability and measurement uniformity 
was obtained by maximum use of common equipment. The essentially 
continuous coverage was obtained by recording mainly the significant 
fading data. Even so the subsequent processing was a formidable task, 
even with the computer, because of the high volume of raw data. 


IV. NONFADING SIGNAL VALUES 


A fade is defined as a decrease in the envelope of the received signal 
voltage with respect to a reference or free-space value. Thus before fading 
data can be quantified, the reference or nonfading value must be deter- 
mined. 

If the atmosphere between the transmitting and receiving antennas 
was homogeneous (that is, no vertical or horizontal variations in the 
index of refraction), then the single frequency RF power at the output 
of the receiving antenna would be invariant for a fixed transmitted 
power.* Its value (called the free-space value) could be calculated in a 
straightforward manner. However, even during nonfading periods, there 
are small time-varying random deviations in the refractive index which 
cause small scintillations in the received power even when the average 
value remains constant. There are also long-term variations in the re- 
ceived RF power due to equipment variation. For our purposes we must 
determine the nonfaded received power as a function of time and, if 
possible, quantify the scintillations. 

Inspection of the data showed that the midday hours had the least 
amount of fading. Here the differentiation between fading and free-space 
scintillations is made on the basis of the magnitude of the effect. Fading 
causes Variations of one or more quantizing levels in the envelope from 
hour-to-hour on most of the 15 channels. 

To establish a reference value, midday periods were sought which had 
no fading with respect to either time or frequency. It was easy to find a 
total of 129 midday hours simultaneously for all channels on 30 different 
days scattered throughout the entire 68-day period. 

Table II gives the summaries for the midday values. The table shows 
the average signal in terms of quantizing level for five consecutive time 
periods of from 9 to 20 days duration. Several points can be made about 


* Assuming adequate ground clearance and no ground reflections. 
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TasBLe [J—AvEeracr NONFADED VALUES 
(in terms of quantizing levels) 
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Paulding 
4- 6 








Total 
Hours in 
Period 254 375 482 264 263 


Hours 
Used 31 22 24 25 27 


Total 
Days in 
Period 9 16 20 11 12 


Days 
Used 7 7 7 5 4 

















Note: Quantizing level 4.X means that the average value was 0.X of a level 
offset. from the center of level 4 in the direction of level 5. 


this data. First the maximum peak-to-peak variation on any channel is 
one level or about 2 dB while the average variation is ++} of a level or 
about -++0.5 dB. Further some of the channels, for example, 4-1 and 
6-13, exhibit a definite trend over the 129 hours. The belief is that these 
long-term effects are attributable to the radio equipment. 

In any case, the average deviation of +0.5 dB is small enough so that 
a single reference value for each channel can be used for the entire 
time period. This simplifies data reduction considerably. 

Now consider the statistics of small scintillations in the received signal 
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power. Table III gives the percent distributions by level for all the chan- 
nels for the 129 midday hours. Of course this distribution includes the 
long-term equipment variations in the reference values as well as the 
short-term scintillations. Note that the channels with the minimum 
variations in average value from Table II are those with most of their 
“90 percent hours” in a single level. These are 4-9, 4-11, 6-14, and 6-17. 
It is assumed that the variations on these channels are due only to scintilla- 
tion and that this effect can be represented by a probability distribution 
which is normal in dB. The o of this distribution can be found from the 
percent values given in Table III with the results shown in Table IV. 
The agreement between these channels is excellent. The conclusion is 
that the scintillation effect over a 68-day period is universal with a o of 
0.6 dB superimposed on an equipment variation of +0.5 dB. The rms 
variation in reference value is then 0.8 dB. 


4.1 Channel Calibration 


The data on reference values were combined with the MIDAS ecalibra- 
tion curve to calibrate the 15 RF channels in dB. First, all the 6- and 4- 


TaBLe IJJ—Summary Over ENTIRE 68 Days 
(Data for 129 Hours on 80 Days) 


Hours with 90 Percent 














Percent of Time in Level or More of Time in Level 
Channel 

Freq. 1 2 3 4 5 1 2 3 4 5 
West 
Unity 

4- 7 0.32 | 78.25 | 21.438 63 

-1 18.1 80.50 1.40 12 93 

- 8 17.52 | 81.34 1.138 0.01 9 88 

- 2 0.01 | 86.25 | 13.74 95 

- 9 7.41 | 91.94 0.65 1] 111 

~-11 10.36 | 88.93 0.71 96 

6 0.77 | 32.50 | 66.73 1 | 16] 58 
6-11 28.57 | 69.60 1.83 9 47 

-13 34.56 | 63.68 1.76 31 71 

-14 6.01 | 92.65 1.34 4 118 

~15 83.95 | 16.01 0.04 94 6 

-17 2.10 | 94.96 2.94 113 

-18 17.88 | 78.96 | 3.16 4} 69 | 1 

6-UD 34.65 | 65.27 0.08 18 42 8 
6-LD 33.2 66.80 24 | 66 

Paulding 
4- 6 4.09 | 58.04 | 37.87 2, 59 A2 
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Taste [V—Lonc-TrEeRM STANDARD DEVIATION 





Channel o in dB 
4-9 0.56 
4-11 0.59 
6-14 0.58 
6-17 0.56 





GHz channels were simultaneously lined up at their reference level which 
was specified as 0 dB. By inspection, 29 dB values were chosen over the 
fading range in order to give minimum ambiguity over the entire set of 
channels; thus each quantizing step on each channel was not used more 
than once. In this way all the 6- and 4-GHz channels were simul- 
taneously calibrated; this was done so that an arbitrary subset could 
be chosen for analysis without having to recalibrate. Figure 5 gives an 
example of the results of the calibration procedure for channels 4-7, 
4-2, and 4-9 for fades greater than 20 dB. 

Because the calibration curve is nonlinear, this process requires some 
judgment. The minor combined effects of the nonlinear calibration 
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Fig. 5—Calibration example. 
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curve and differing reference levels for different channels are discussed 
in conjunction with the single channel outage statistics. 


V. SINGLE CHANNEL RESULTS 


The raw data were obtained continuously for almost all of the 68 days 
(5.9 X 10° seconds). Of this total, 5.26 X 10° seconds was used as the 
data base; the balance was lost mainly because of routine radio main- 
tenance. To condense the data, a criterion was used to select by computer 
only those time periods which exhibited fading. The start of such a time 
period was defined by, and included, ten consecutive measurements 
containing any one channel faded below approximately 10 dB. The 
end of the time period was defined as that instant for which the next 
110 consecutive measurements on any channel did not have a fade 
exceeding approximately 10 dB. 

From the total of 5.26 X 10° seconds, 7.8 X 10° seconds (14.8 percent) 
were selected for analysis. The average length of the periods selected 
was sizeable. There were only 96 distinct periods selected; these had 
an average length of 8.1 X 10° seconds (2 4) hours.) Further one-half 
of the analysis time was in intervals of four hours or longer. Thus 
any effects due to beginning or ending a time period should be minimal. 

The data were processed by computer to determine the total amount 
of time during which each signal was less than a certain amount. The 
4-GHz single-channel fading results are given on Fig. 6 for fades greater 
than 20 dB. These results and all those to follow are given as a fraction 
of 5.26 X 10° seconds. It is apparent that these statistics are essentially 
the same for all the 4-GHz channels and have the Rayleigh slope, 
that is, 10 dB per decade of probability over the entire range of data 
points. The solid line on the figure is a least-square fit of a Rayleigh 
slope line to the data points, most of which are within +1 dB as shown 
by the dashed lines. This scatter is due to both the uncertainties in 
the reference value and to the nonlinear calibration. 

The 4-2 points outside the 2-dB corridor from 22 to 29 dB are due 
to the nonlinear quantized calibration. From Fig. 5 note that for 4-2 
the dB values used lie near the bottom of the quantizing levels up to 
31 dB at which point they change to the middle of the quantizing levels. 
This gives the effect noted on Fig. 6, that is, the data points are shifted 
to higher fade values for a constant probability. Other anomalies of 
this type in the single-channel results are explainable in this manner. 
For these results and for all others described here, the polarization of 
the signal (s) had no apparent effect. 

The 6-GHz signal channel results are given on Fig. 7 for fades greater 
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Fig. 6—Fade-depth distribution; 4-GHz channels. 


than 20 dB. Again all 6-GHz channels have essentially the same statistics 
with the solid line being the least-squares fit with a Rayleigh slope. 
Almost all data points are within +1 dB of the average above 40 dB 
except for 6-15 from 37 to 40 dB. This discrepancy is attributable to 
nonlinear quantizing as discussed for 4 GHz. The increased scatter 
above 40 dB is thought to be due to decreasing measurement sensitivity. 

The single-channel results for the space diversity grouping’’ (the 
6-18 signal is received on the horn reflector and two dishes) and for 
the 4-GHz channel on the Paulding route are given on Fig. 8. The lines 
are the least-squares fit with a Rayleigh slope. 

Figure 9 gives a summary of the single-channel statistics and for 
comparison, the true Rayleigh curve. The equations of the least-square 
lines are 


West Unity 4: P=0.2510"° 
6: P =0.53107"7" 

SD: P=0.4310°"”° 

Paulding 4: P=0.7710°°"" 


where F is the fade depth expressed in dB (F = 20 dB). The channel 
with the most fading was the 4-GHz Paulding followed by 6 GHz, space 
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Fig. 7—Fade-depth distribution; 6-GHz channels. 











FRACTION OF TIME THAT FADE DEPTH 2 ABSCISSA 








20 25 30 35 40 45 
FADE DEPTH IN dB (RELATIVE TO MIDDAY NORMAL) 


Fig. 8—Fade-depth distribution; space diversity grouping and Paulding. 
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Fig. 9—Summary of fade-depth distributions. 


diversity grouping (SD) and 4 GHz. The 4-GHz channels have signifi- 
cantly less fading than either 6-GHz (by 3.8 dB) or 4-GHz Paulding 
(by 4.9 dB). The 0.9-dB difference between signals on antennas of 
different height, that is, 6 GHz compared to SD, is thought to be more 
apparent than real, although it may be a small height effect. 

It should be noted that having essentially the same fade distribution 
for the 6-18 signal as received on both the dishes and horn reflector 
implies two things. First, the effect of 6-GHz multimoding in the horn 
reflector, circular waveguide, and combining networks must be negligible 
because the dishes use dominant mode elliptical waveguide and no 
combining networks. Secondly the effect of decreased clearance at 
midpath for the lowest dish is less than 0.9 dB. 

One way of explaining the significant differences shown on Fig. 9 is 
to examine them in terms of the terrain and the radio path lengths. 
Pearson’ has given data taken in Britain on the relation between worst- 
month fading and the terrain as characterized by the path roughness.* 

* Path roughness is the standard deviation of terrain height measurements at 


one-mile intervals on a line between transmitter and receiver with the end points 
of the path excluded, 
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Assuming that the 68-day period is equivalent to the British worst- 
month data, Table V can be compiled from Fig. 9 and Ref. 7. 

The 6-GHz British point has been obtained by assuming that the 
path length is 50 percent longer at 6 GHz than it is at 4 GHz; that is, 
the path length is cast in terms of wavelengths. 

There is good agreement between the British and the West Unity data. 
Thus the difference in depth of fade for a given percentage of time is 
apparently directly related to the terrain roughness and to the path 
length in wavelengths. Of course this is not sufficient evidence to justify 
the extensive use of these parameters. It has long been known, at least 
qualitatively, that fading is more severe over smooth terrain or water 
than on rough paths of comparable frequency, length, and atmospheric 
conditions. 


VI. FREQUENCY DIVERSITY RESULTS 


The simultaneous measurements on a number of different frequencies, 
together with computer processing of the differences in signal level with 
frequency, have provided much more quantitative information than 
previously available on the improvements to be expected from the use 
of frequency diversity. The diversity results specify the total amount of 
time during which the stronger of two signals was less than a certain 
amount (this means that both signals simultaneously were less than 
the given amount). 


6.1 6 GHz 


The results for the 6-GHz pairs for fade depths = 20 dB are given 
on Figs. 10 through 16. Fifteen pairs were obtained from the six 6-GHz 
channels and they are grouped according to frequency separations as 
shown in Table VI. 

Four lines are shown on each figure. The uppermost is the nondiversity 
line which is the average single-channel fade-depth distribution as dis- 


TABLE V—PatH ROUGHNESS EFFECTS 


0.1 Percent Fade Depth 


Roughness British West Unity 
Pleasant Lake 4 GHz 16.0 meters 23.5 dB 24.0 dB 
6 GHz 16.0 meters 29.0 dB 27.3 dB 


Paulding 4 GHz 8.5 meters 28.0 dB 28.9 dB 
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Fig. 10—6-GHz frequency diversity; 30-MHz separation. 


cussed previously. The bottom solid line which is tagged with a value 
of a parameter m, iS a curve fitted to the data. The dashed lines are 
relative to the fitted line and denote a +2-dB corridor which is an estimate 
of the uncertainties in the data due to nonlinear calibration and reference 
value determination. The fitted curve is obtained by assuming that the 
diversity data is jointly Rayleigh distributed with respect to the non- 
diversity curve. The parameter mz, is related to the amount of correlation 
between the two components of the distribution. This concept will be 
discussed in more detail later. 
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Fig. 11—6-GHz frequency diversity ; 60-MHz separation. 


Inspection of these results (Figs. 10 through 16) shows that for a 
fixed frequency separation, the scatter of the data points with respect 
to the fitted diversity line is small below 30-dB fade depth but increases 
somewhat for larger fade depths.* However for fade depths of 40 dB 
or less, all the data points lie within the +2-dB corridors except for 


* On the figures, 10-§ = 5.26 seconds which means that there were few samples 
at the higher fade depths. 
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17/18 on Fig. 10. This latter result is an anomaly because all other 
combinations which include 6-17 or 6-18 are quite consistent within 
their group. In fact, the consistency of the data points for different 
pairs having the same frequency separation is remarkable. Also note 
the excellent agreement between the data and the fitted line for the 
pair with the maximum frequency spacing (210 MHz). 

As the frequency separation increases, it is to be expected that the 
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Fig. 12—6-GHz frequency diversity; 90-MHz separation. 
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Fig. 13—6-GHz frequency diversity; 120-MHz separation. 


diversity performance will improve. This is borne out on Figs. 10 through 
16 and is described by increasing values of m, for increasing frequency 
separation. The performance of frequency diversity relative to non- 
diversity will be discussed in a later section. 


6.2 4GHz 


The results for the 4-GHz frequency diversity pairs are given on 
Figs. 17 through 24. Twenty-one pairs were obtained from the seven 
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Fig. 14—6-GHz frequency diversity; 150-MHz separation. 


different TD-2 channels and they are grouped according to frequency 
separations as shown in Table VII. 

The lines on the figures have exactly the same meaning as in the 6-GHz 
case discussed in the previous paragraphs. 

Inspection of the results shows that the scatter of the points with 
respect to the fitted diversity line is small for fade depths less than 30 
dB except for 7/1 on Fig. 17 which has been ignored as an anomaly. 
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For greater fade depths, the scatter increases and the data points tend 
to fall off faster than the fitted line except for Fig. 23 which has a distinct 
upward bulge. The fast rolloff might result from noise or interference 
effects in the radio system. Since the 6-GHz results do not exhibit these 
effects, the MIDAS system and the data reduction procedures are 
probably not the source of this rolloff since all of the radio channels 
were treated identically. Further some of the pairs follow the fitted 
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Fig. 15—6-GHz frequency diversity; 180-MHz separation, 
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line without any rolloff, for example, 4-9/11 on Fig. 21 and 4-7 /2 on 
Fig. 20. The reasons for the anomalies are not explicitly known but it is 
assumed that they are not generated by multipath fading. In any case, 
the fitted line is a conservative approximation to the data except for 
Fig. 23. 

Just as in the 6-GHz case when the frequency separation increases, 
the diversity performance improves. This is described by increasing 
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Fig, 16—6-GHz frequency diversity ; 210-MHz separation, 
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TaBLE VI—6-GHz FREQUENCY Diversity RESULTS 





Frequency Number 

Figure Separation (MHz) of Pairs 
10 30* 3 
11 60 3 
12 90 3 
13 120 3 
14 150 1 
15 180 1 
16 210 1 


* This is also the nominal bandwidth of the working channel. 


values of m, for increasing frequency separation and will be discussed 
in a later section. 


VII. DESCRIPTION OF SIMULTANEOUS FADING AT DIFFERENT FREQUENCIES 


Multipath fading is caused by complicated interference phenomena 
and it is possible that various descriptions of simultaneous fading are 
useful. Models for fading can be postulated on two levels. First there 
is a mathematical (statistical) description of the characteristics of 
multipath fading. Second, on a more fundamental level, there is the 
model for the physical process that creates fading and from which the 
mathematical (statistical) model could be derived. At the present 
time there is no physical process model which gives results that agree 
well with the experimental data. On the other hand, a statistical model 
based on the joint Rayleigh probability distribution has been useful 
in the description of space diversity, and it is applied here (with con- 
siderable success) to frequency diversity. However the physical process 
model is still the ultimate goal and the experimental data and empirical 
formulas presented here should aid in attaining this goal. 

The following discussion briefly gives the relevant details of the joint 
Rayleigh distribution as applied to the data. For a Rayleigh variate, 
the probability that the envelope voltage R, of the signal normalized 
to its rms value has a value less than L is 


Pr(R, < L) = 1 — exp(—L’). (1) 
Similarly the probability distribution of the envelope voltage R. of a 
second signal normalized to the rms value of the first signal is 


Pr (Rz < LZ) = 1 — exp (-4) (2) 
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Fig. 17—4-GHz frequency diversity ; 20-MHz separation. 


where 
v = (Re )ev((Ri av) + (3) 

The joint probability distribution function of the variables R, and Rz is’® 
Pr (R, < L, Rp < L) = i 


0 


L?/(1-k?) (L/v)?/(1—-k?) 
dX, i dX P(X, , Xs) 
0 
(4) 
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with 
P(X,, Xe) = (1 — kh’) Io(2k(X1X2)'”7] exp[—(X1 + X2) 


where k’ is the correlation coefficient of R? and R? . For use in this paper 
Mp has been defined as 


mp, = 10°(1 — k’). (5) 
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Fig. 18—4-GHz frequency diversity; 60-MHz separation. 
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Typical computed results are shown in Fig. 25 for v? = 1. For deep fades, 
asymptotic forms of equations (2) and (4) are quite useful. 


Pr(R, < L) = Lv” (6) 


and 
Pr(R, < L, R. < L) & (10°/mz)(L*/”). (7) 


The region of validity of equations (6) and (7) depends on v, L, and mz, . 
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Fig. 19—4-GHz frequency diversity ; 80-MHz separation. 
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Fig. 20—4-GHz frequency diversity; 100-MHz separation. 


For example it is the region in Fig. 25 where the lines are parallel to 
the mz = 10° line. 

The joint Rayleigh distribution, calculated from equation (4), was 
fitted to the diversity data points by overlaying plots of the joint dis- 
tribution for various values of m,z and choosing the one with the best 
apparent fit. The results of this are the bottom solid lines on the diversity 
plots with the value of mz next to each line. In the fitting, somewhat 
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Fig. 21—4-GHz frequency diversity; 140/160-MUHz separation. 


more weight was given to the values at 30-dB fade rather than at 40 dB 
because of relative sample size. Also note that the curvature of the 
joint Rayleigh fits the curvature of the data points for the smaller fade 
values. 


VIII. IMPROVEMENT 


The quantity of interest in any diversity scheme is the amount of 
improvement relative to the nondiversity performance. Here this per- 
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formance measure is defined as the ratio of fractional outage of the 
nondiversity signal to that of the diversity signal for a fixed fade depth. 
Description by this factor (Z) is convenient because it avoids detailed 
description of the many schemes that are used to process the two signals. 
The best of these switching or combining schemes will provide per- 
formance equal to or somewhat better than that described by the fade 
reduction factor. 

The fraction of the total time that a nondiversity signal is faded 
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Fig. 22—4-GHz frequency diversity ; 220/240 /260-MHz separation. 
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depends on frequency, path length, terrain, antenna placement, and 
climate. The last of these determines the fraction of the total time that 
fading conditions exist on a given path. The periods used in analysis 
were those for which fading conditions were in existence. Any change 
in the total time of such fading periods would have no effect on the 
statistics since they pertain to the fading phenomena and not to the 
length of time (assuming an adequate number of samples are available). 
However, the statistics have been normalized by adding in the remaining 
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Fig. 23—4-GHz frequency diversity ; 300/320/340-MHz separation. 
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Fig. 24—4-GHz frequency diversity; 400/420-MHz separation. 


or nonfading time. The effect of this or any like change in the amount 
of the nonfading time is a uniform shift in the nondiversity and diversity 
curves without changing their shape or their ratio; that is, the fractional 
time scale is multiplied by a constant. This last fact has been heavily 
utilized in the analysis where this ratio has been called the improvement 
factor (1). Note that the improvement factor does not depend on how 
often fading conditions exist but rather upon what happens within 
these selective fading periods. 
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Frequency Number 

Figure Separation (MHz) of Pairs 
17 20* 2 
18 60 2 
19 80 3 
20 100 2 
21 140 1 
160 2 
22 220 1 
240 1 
260 1 
23 300 1 
320 2 
340 1 
24 400 1 
420 1 





* This is also the nominal bandwidth of the working channel. 


Referring to the asymptotic forms for the joint Rayleigh model, 
equations (6) and (7), the asymptotic form of the improvement factor 
(I) for Rayleigh fading can be stated as 


_ Pr (R, < L) _ _(mz/10") (8) 
Pr(R, < L,R, <L) Pr(R, < D) 


where, for the time being, it is assumed that both signals have the same 
rms value (that is, »” = 1). 

The experimental improvement factors were obtained from the ratio 
between the fitted diversity line and the nondiversity lines for the 6-GHz 
and 4-GHz frequency pairs at a 40-dB fade depth. The values are plotted 
on Fig. 26 versus the parameter Af/f. Here f is taken as 3950 MHz for 
the 4-GHz band and 6175 MHz for the 6-GHz band and Af is the average 
frequency separation for a grouping on a single figure, for example, 
Af = 240 MHz for Fig. 22. If the +2-dB uncertainty were included, 
the points plotted on Fig. 26 would change to vertical lines between 
1.58 and 1/1.58 of the average value shown. Even with this large range 
of uncertainty, it appears that the improvement and Af/f are linearly 
related as shown by the lines on the figure. The equations of the lines are 


I 


4 GHz: J = . (().-| for I = 10, good accuracy ; 
2\f 
(9) 
Af 


6 GHz: J = ; (21), for 1 SI S 10, less accurate but conservative; 


V 


A 
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where F = —20 log L is the fade depth in dB. This is the asymptotic 
form of the formulas including the variation with fade depth as shown 
in equation (8). 

Using equation (8) as a guide, it is conjectured that the experimental 
improvement can be separated into two parts which contain respectively 
the nondiversity fading and the frequency diversity effect, that is, 


m/10° 


ae P(L) 





(10) 
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Fig. 25—Probability that both signals are simultaneously less than a given 
amount (Joint Rayleigh Distribution). 
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where P(Z) is the measured probability that a nondiversity fade exceeds 
—20 log L dB and m is a frequency diversity parameter. Of course both 
these quantities are functions of frequency, path geometry, terrain, 
and antenna placement. 

Consider first the variation of m with Af/f and secondly the difference 
in improvement between the 4-GHz and 6-GHz bands. 

The nondiversity results [P(L)] can be written as (see Fig. 9) 


6 GHz: P, = (.53)L, 
4GHz: P, = (.25)L’, 


(11) 


where F = —20 log L is the fade depth in dB. Then from equations (9) 
and (10) 
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Fig. 26—1966 West Unity in-band frequency diversity improvement ratio at 
40-dB Fade Depth (L = 0.01). 
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(10°) (3) (A!)n-*c2syre, 


Af 
125 j 
The difference between m, and mz, is small. Thus as desired m depends 
primarily upon normalized frequency spacing and not upon either the 
nondiversity fading distribution or the radio frequency band (4 vs 6 GHz). 
For further use it is assumed that m = 1380 Af/f. 
Using equations (9) and (10) again and forming a ratio gives 


Tico Ps 
lg #; 


4 GHz: m, 


I 


(13) 


= 2.1 (14) 


which agrees very closely with the experimental ratio of 2 shown on 
Fig. 26. Thus equation (10) correctly predicts the relative improvement 
between the 6- and 4-GHz bands. Further this relative improvement 
depends upon the nondiversity fading results and not upon the normalized 
frequency spacing. 

To recapitulate, the asymptotic value of improvement of an in-band 
frequency diversity pair relative to the nondiversity signal at a fade 
depth of —20 log L dB can be stated for the experimental data as 

0.18 4 

I= PUD) (15) 
where P(Z) is the probability that the nondiversity signal fades below 
the given depth. In this formula, J is not affected by the relative amount 
of time that fading conditions do or do not exist. However both the 
numerator and denominator in equation (15) would change by the 
same multiplicative constant when the ratio of nonfading to fading 
time changes. Thus the terms P(Z) and 0.13 Af/f individually apply 
only to the experimental path but their ratio is more generally useful. 

This ratio (1) characterizes frequency diversity during multipath 
fading periods. Although J was obtained from experimental data on 
one path, it should pertain to other paths of about the same length 
but having different terrain and climate. The terrain and climate play 
a major role in determining the fraction of time that multipath fading 
conditions exist but they probably will have only a secondary effect 
on the relation between a nondiversity signal and a diversity signal 
within a multipath fading period. 
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IX. CROSSBAND FREQUENCY DIVERSITY 


Results were also obtained for a subset of the 4-GHz and 6-GHz 
channels where the diversity pair consists of one channel from each group. 
The channels used for analysis were 4-2, 4-1, 4-7, 4-6 and 6-11, 6-15, 6-18. 
The results are given in Figs. 27 through 30. The groupings for each 
figure are for one of the 4-GHz channels in diversity with each of the 
6-GHz channels. As before, there are several curves on each figure. The 
two uppermost are the average nondiversity results for each band with 
the 6 GHz being 3.3 dB poorer than the 4 GHz for a fixed probability. 
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Fig. 27—4/6-GHz crossband frequency diversity. 
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Fig. 28—4/6-GHz crossband frequency diversity. 


Again the joint Rayleigh distribution was fitted to the data by over- 
laying plots of the joint distribution for various values of mz . In this 
case the rms values are unequal by an amount 


—10 logv? = 3.3. dB 


or (16) 


v? = 0.47. 


The asymptotic form of the improvement factor J between the diversity 
curve and the top nondiversity curve (6 GHz) is given as in equation (8) 
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Fig. 29—4/6-GHz crossband frequency diversity. 


by 
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Pr (R, < L) 


This corresponds to the improvement obtained if a 4-GHz channel were 
used to protect a 6-GHz channel. 

The asymptotic ratio between the bottom nondiversity curve (4 
GHz) and the diversity curve is then 


mr/10° 
Pr(R, < D) 


| = (17) 


| ee = v” = UI max (18) 
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which corresponds to the improvement obtained if a 6-GHz channel were 
used to protect a 4-GHz channel. In these formulas, mz and Pr (R, < L) 
are Rayleigh quantities with m, related to the correlation coefficient and 
— 20 log L equal to the fade depth in dB exceeded by the envelope voltage 
R,. 

Inspection of the results shows that the points have more scatter than 
the 6-GHz in-band diversity data and just about the same scatter as the 
4-GHz in-band diversity data, that is, the fitted line is a good repre- 
sentation of the data from 20 to 30 dB with increasing divergence for 
greater fade depths. 

As to quantitative interpretation, the results do not appear to be as 
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Fig. 830—4/6-GHz crossband frequency diversity. 
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yielding to analysis as the in-band diversity. This may result because the 
frequency spacings are a significant percentage of the average center 
frequency, for example, spacings of 1775 MHz (4-6/6-11) to 2405 MHz 
(4-7/6-18). The improvement values obtained from the fitted lines on 
the figures at 40-dB fade depth are presented in Table VIII. 

The results do not show a consistent behavior as a function of frequency 
separation. 4-7, which has the largest frequency separation, shows slightly 
more improvement than 4-1 or 4-2 but less than 4-6 which has the smallest 
frequency separation. However, 4-2, 4-1, 4-7 are tightly bunched in 
frequency whereas 4-6 is about 300 MHz closer to the 6-GHz band. 

In any case, these results are comparable to the in-band diversity re- 
sults, that is, the improvement from crossband diversity was not sig- 
nificantly better than in-band diversity of two percent or more separation. 
Thus there may be a saturation effect which will appear for frequency 
separations above say 10 percent. There is neither enough data nor a 
theory to prove or disprove such speculation. 


xX. CROSS ROUTE DIVERSITY 


Diversity results were obtained for various 4-GHz and 6-GHz channels 
on the Pleasant Lake hop in diversity with the single 4-GHz channel 
measured on the Paulding hop. The previous data strongly implies that 
it may be very misleading to rely on the results for a single channel. 
However, this data is included for completeness. To review: the Paulding 
data is for a different path but for the same time periods. One would there- 
fore expect the diversity performance to be very good since the signals 
from the pair of paths should be reasonably independent. However this 
did not appear to be the case. 

The data are shown in Figs. 31 and 32 in the groupings presented in 
Table IX. The lines on the figure have exactly the same meaning as the 
corresponding ones in the crossband section. In this case the 6-GHz fit 


Tasie VIII—CrosssBanp IMPROVEMENT VALUES 





Enax In in® 
Fig. 27 250 125 
Fig. 28 200 100 
Fig. 29 150 75 
Fig. 30 500 250 


* Tnin = UV? Imax from equation (16) with v ‘= 0.47 but 0.5 has been used in this 
table for convenience. 
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Fig. 31—4/6-GHz cross route diversity. 


is good but the 4-GHz fit is poor below 30 dB in that the data has an 
upward bulge. There is no explanation available for this anomaly. 

In any case, the improvement obtained when the two channels in the 
diversity pair are on different hops is not significantly better than in-band 
diversity (see Fig. 26). This is surprising and raises questions about the 
correlation between fading on adjacent hops, for example, the maximum 
possible diversity improvement may be limited to values less than that 
expected from independent fading. 

To repeat, this is based on a single channel and as such the data base 
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Fig. 32—4-GHz cross route diversity. 


TABLE [X—Cross-RovutEe RESULTS 





4/4-PA, v? = 0.322 (used 0.31) 
6/4-PA, v? = 0.715 (used 0.75) 


Figure Diversity Pairs Tmax Im in* 
31 6-11 with 4-6 PA 400 300 
6-18 with 4-6 PA 
32 4- 7 with 4-6 PA 800 250 
4- 6 with 4-6 PA 
* Imin = 07 Imax 
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is simply not sufficient to draw any profound conclusions about cross- 
route effects. 
XI. COMPARISON OF FREQUENCY AND SPACE DIVERSITY IMPROVEMENT 


The empirical results for space diversity given in Vigants’® can be 
compared with those obtained here for frequency diversity. The improve- 
ment factor for space diversity is 


Isp >= 





s F/10 
275Dn 1° (19) 


where 


s is vertical separation between equal antennas in feet, 
D is path length in feet, 

d is wavelength in feet, and 

F is fade depth in dB. 


Using D = 28.5 miles and equation (9) gives the various diversity 
improvement factors as presented in Table X. 

These are plotted in Fig. 33 for a fade depth of 40 dB. Several points 
are immediate. First the improvement increases with frequency for space 
diversity and decreases with frequency for frequency diversity; that is, 
space diversity becomes relatively more effective as the operating fre- 
quency increases. The maximum improvement for frequency diversity 
is 100 for the maximum allowable spacing of 4 percent in the standard 
6-GHz frequency plans. Space diversity of 26 feet will give this improve- 
ment. Since this spacing is reasonable, it can be said that space diversity 
is ‘‘better” than frequency diversity at 6 GHz. At 4 GHz, the correspond- 
ing values are J = 625 for 12.5 percent and 79’ spacings. In this case, 
frequency and space diversity are comparable in performance. 

These comparisons have been made only for one-for-one space and 
frequency diversity on a single hop; additional data and studies are 
needed to clarify our understanding. 


TABLE X—Diversity IMPROVEMENT FACTORS 


4. GHz 6 GHz 
Frequency 0.5(Af/f)107/19 0.25(Af/f)10¥/19 











Space (s?/105)10F/10 1.5(s?/105)10F/10 
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Fig. 33—Comparison of space and frequency diversity at a 40-dB fade depth. 
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Computed Transmission Through Rain at 
Microwave and Visible Frequencies 


By DAVID E. SETZER 


(Manuscript received May 5, 1970) 


In this paper we present tables which contain the Mie scattering coef- 
ficient, absorption coefficient, extinction coefficient, equivalent medium index 
of refraction and phase delay for rains conforming to the Laws and Parsons 
drop-size distribution. These transmission characteristics have been calcu- 
lated for microwave frequencies of interest in common carrier radio relay 
systems, 800 to 1.43 GHz, that is, 0.1 to 21.0 cm, at rain rates from 0.28 
to 150.0 mm/hr. We also include the extinction coefficients for the visible 
wavelength 0.6328 py. 

The microwave tables were generated by using a Mie scattering computer 
program similar to that designed and previously reported by Deirmendjian. 
The calculations at 0.6328 pw were made separately by employing the usual 
assumptions for droplets with very large circumference to wavelength ratios. 


I. INTRODUCTION 


The Mie extinction properties are of basic importance to those 
interested in developing an understanding of the influence of rainfall 
on open air communication systems. In this connection we have gen- 
erated a rather extensive set of tables of extinction properties of rain. 
The tables have been used within Bell Laboratories to study a variety 
of transmission problems, examples of which are the investigation of 
satellite ground station interference by Gusler and Hogg (1970),* the 
study of microwave transit time variations by Gray (1970), Pierce’s 
(1969) investigation of the problems associated with the synchronization 
of digital networks and Setzer’s (1969) study of the extinction properties 
of atmospheric aerosols.*~* A set of tables with similar results was 
published by Medhurst (1965); however, his presentation only includes 

* The attenuation constants used by Gusler and Hogg were based on empirical 


data. The calculated values presented in this paper were used for comparison purposes 
only. 
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total attenuation.” Our tables include the Mie scattering coefficient, 
absorption coefficient, extinction coefficient, the van de Hulst equivalent 
medium index of refraction and the van de Hulst phase delay for rains 
conforming to the Laws and Parsons drop-size distribution. These 
transmission characteristics have been calculated for incident micro- 
wave wavelengths of 0.1, 0.2, 0.3, 0.5, 1.0, 1.62, 1.88, 2.73, 5.0, 7.5, 10.0, 
15.0 and 21.0 em (corresponding to 300, 150, 100, 60, 30, 18.5, 16, 11, 
6, 4, 3, 2 and 1.43 GHz) at rain rates of 0.25, 1.25, 2.5, 5.0, 12.5, 50.0, 
100.0 and 150.0 mm/hr. Also included are the extinction coefficients 
for the visible wavelength 0.6328 yu at the above rain rates. 

The calculations in the microwave region were performed on a 
GE 635 computer using a scattering program similar to that previously 
presented by Deirmendgian (1963).° Since the raindrop circumference- 
to-wavelength ratio (wd/\), that is, size parameter, for the visible 
wavelength, is outside the range of validity of the computer program, 
approximate characteristics were calculated for 0.6328 yu. The usual 
assumptions for spheres with very large parameters were employed. 

The indices of refraction used in this report and shown in Table I 
are for a rain temperature of 20°C. They were obtained by cross check- 
ing many of the standard optical and microwave references and are 
thought to be reliable. 


II. DROPLET SIZE DISTRIBUTION 


All computations in this paper are based on the assumption that 
raindrops are spherical and the distribution of rain is as was measured 
by Laws and Parsons and quoted by Kerr (1951).’ The Laws and 
Parsons distribution is presented in Table II as the percentage of 
total water volume within specific size ranges. In order to use the 
computer program, it is necessary to express the distribution in terms 
of the number of droplets per unit volume within specific size ranges. 
If the droplets are assumed to fall at the terminal velocity Vo , that is, 
up and down drafts are neglected, then the conversion is 


Diss ) d;) ~ Ry Piss ’ d:)/{V (a) Vo(@)], (1) 


where D(d;., , d,) represents the size distribution in units of droplets 
per unit volume in the droplet diameter range d;,, to d; . Henceforth, 
the diameter range d;,, to d; will be called Ad; . R; is the total rainfall 
rate which is typically specified in mm/hr; P(d;., , d;) is the volume 
percentage rainfall in the diameter range Ad; as measured by Laws 
and Parsons; d is the average diameter in the range Ad; ; and V(d) 
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is the volume of a sphere of diameter d. The terminal velocities of 
raindrops V,(d) are presented in Table III.’ 

For an example of the function D(d;,, , d;) resulting from the use of 
equation (1), refer to Fig. 1. 


III. TRANSMISSION PARAMETERS FOR MICROWAVE FREQUENCIES 


The Mie coefficients and the equivalent index of refraction of the 
rain medium are defined by van de Hulst (1957).° For a detailed des- 
cription of these parameters, please refer to his work. Essentially, 
the scattering coefficient 8,2, (A) and the absorption coefficient 8,,, (A) 
are measures of the total energy scattered and absorbed by a unit 
volume of rainfall. In the simple case of a single scattering aerosol 
the ratio of intensity of the transmitted beam J,(A) to that of the 
incident beam J,(A) is 


Ip(A)/To(X) = exp[—Bexe(A) -U; (2) 


where 1 is the length of the propagation path through the rain and the 
extinction coefficient 6...(A) is 


Bext(A) Buce(®) Bavs(A)- (3) 


A plane-parallel medium containing many scattering particles can 
be represented by a slab of homogeneous material having a complex 
refractive index m. Carefully note that this sort of representation can - 
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Fig. 1—Laws and Parsons drop-size distribution for 150 mm/hr rain. 
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be designed, by selecting the appropriate value of #7, to preserve the 
input-output relationships but all detail of the scattering process 
within the medium is lost. According to the van de Hulst (1957) defini- 
tion of #7, the amplitude and phase of the incident wave are changed 
by the slab in the proportion 


exp[— (2al/d)Im(1 — )]- exp[—(2alz/\)Re(m — 1)], (4) 


where the first term is recognized as defining the amplitude ratio and 
the second the phase.*® The values of Bort » Bavs » Bscat » # and the phase 
angle described above have been calculated for the specified microwave 
frequencies. The results appear in Tables IV through XVI. The reader 
is advised to use special care when attempting to apply the van de Hulst 
phase angle and medium index #. It is reeommended that van de Hulst’s 
derivation be studied carefully so that the meaning and limitations of 
these functions are well understood. For example, light reflected from 
the slab cannot be derived by using the refractive index im, but should 
be computed by means of the actual scattering functions. 

Also, it should be noted that although f is calculated, (#% — 1) is 
used to determine the phase angle. Since fi is very close to one, cancel- 
lation of the leading terms reduces the significant places in the numerical 
value of the phase angle to one or two at most. Consequently the values 
given in the phase change column of Tables IV through XVI exhibit 
noticeable discontinuous jumps. 


IV. TRANSMISSION PARAMETERS FOR 0.6328 yu 


The Mie coefficients 8; are defined as 


Bd) = [ Q,2)n@) dr, 4 = 1,2,3, (5) 


where r is the droplet radius; n(r) is the continuous size distribution, 
and y;(A, 7), t = 1, 2, 3 are the extinction, scattering and absorption 
cross sections, respectively for droplets of radius 7. The smallest ratio 
of raindrop circumference to wavelength for the combination of a Laws 
and Parsons rain and 0.6328 yu is approximately 1500. For most purposes, 
the laws of geometric optics can be applied in such cases and therefore 


Yoxt (A; r) ~ Qar’. (6) 


Also, since the index of refraction of water at 0.6328 uy is a real number, 
1.33, the absorption coefficient will be zero. It follows from equations 
(3), (5) and (6) that 


Bext(0.6328y) _ Bscat(0.6328y), (7) 


EXTINCTION COEFFICIENT & ey (Km) 
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~ Qa [ F rn(r) dr. (8) 


This expression and the Laws and Parsons distribution were used to 
generate Table XVII. In this connection the Laws and Parsons dis- 
tribution D(d;,, , d;) was used to approximate the continuous function 


n(r). 


V. GRAPHICAL REPRESENTATION 


For the purpose of illustration, a graph of extinction coefficients 
versus total water content and rain rate is included (see Fig. 2). Not 
all wavelengths are represented because some of the curves are too 
closely grouped in the neighborhood of those shown. Those that were 
excluded, were excluded for reasons of clarity only. One point of some 
interest is the location of the attenuation curve for 0.6328 y» in Fig. 2. 
Note that it represents a reversal of the trend exhibited as wavelength 


WAVELENGTH 


o.1cM~ 

0.6328 ~\ 

1.00 cm. \\ \ 
\N 
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10°8 107 10°6 
WATER CONTENT (gm/cc) 


Fig. 2—Rainfall water content and rain rate versus extinction coefficient Bext. 
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TaBLE I—INDEX oF REFRACTION OF WATER AT 20°C 
Wavelength (cm) | Index of Refraction 


0.10 2.587 - 0.937(i) 
0.20 3.039 - 1.575(i) 
0.30 3.505 - 2.007(i) 
0.50 4.364 - 2.521(i) 
1.00 5.900 - 2.900(i) 
1.62 7.001 - 2.544(i) 
1.88 7.500 - 2.500(i) 
2.73 8.070 - 1.990(i) 
5.00 8.670 - 1.202(i) 
7.50 8.770 - 0.915(i) 

10.00 8.871 - 0.628(i) 

15.00 8.916 - 0.422(i) 

21.00 9.000 - 0.275(i) 
0.6328 p 1.33 -0.0(i) 


Taste JI— Laws AND Parsons Drop-SizE DISTRIBUTIONS FOR 
VARIOUS PRECIPITATION RATES 


Rain Rate (mm/hour) 


Drop 
Diameter (em) | 0.25 | 1.25 | 2.5 





12.5 | 25 | 50 100 | 150 


























0.05 28.0 | 10.9 | 7.38] 4.7] 2.6] 1.7] 1.2) 1.0] 1.0 
0.1 50.1 | 37.1 | 27.8 | 20.8 | 11.5 | 7.6] 5.4] 4.6] 4.1 
0.15 18.2 | 31.3 | 32.8 | 31.0 | 24.5 | 18.4 | 12.5] 8.8] 7.6 
0.2 3.0 | 13.5 | 19.0 | 22.2 | 25.4 | 23.9 | 19.9 | 138.9 | 11.7 
0.25 0.7} 4.9 | 7.9 | 11.8 | 17.3 | 19.9 | 20.9 | 17.1 | 13.9 
0.3 1.5 | 3.3 | 5.74 10.1 | 12.8 | 15.6 | 18.4 | 17.7 
0.35 0.6| 1.1] 2.5} 4.3] 8.2] 10.9 | 15.0 | 16.1 
0.4 0.2); 0.6; 1.0} 2.3] 3.5; 6.7} 9.0] 11.9 
0.45 0.2} 0.6}; 1.2) 2.1] 3.3) 5.8) 7.7 
0.5 0.3] 06; 1.1) 1.8] 3.0] 3.6 
0.55 0.2] 0.5] 1.1) 1.7] 2.2 
0.6 0.3/ 0.5) 1.0] 1.2 
0.65 0.2| 0.7] 1.0 
0.7 0.3 





TaBLeE IIJ—RainprRop TERMINAL VELOCITY 





Radius, em Velocity, m/sec 
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TasiE TV—MieE ExtincTIoN PARAMETERS AT 0.1 cM WAVELENGTH (300 GHz), H.O InpdEXx oF REFRACTION 
2.587—0.937i, FoR LAws AND Parsons RAIN . 








Phase 

Change 
Scattering Absorption Extinction Medium Index of Refraction ™ dee 

Rain Rate Coef. Coef. Coef. ————---- ) (? 4 
(mm /hr) (km)? (km) (km) Re(i — 1) Im(1 — 1) km 
0.25 0.05390 0.05878 0.1127 0.0 xX 10-6 0.1051 x 1077 0.0 
1.25 0.1705 0.1723 0.3428 0.0 x 1076 0.3051 x 107-7 0.0 
2.5 0.2760 0.2693 0.5452 0.0 X 10-8 0.4763 X 1077 0.0 
5.0 0.4550 0.4306 0.8856 0.0 x 10-6 0.7630 * 1077 0.0 
12.5 0.8913 0.8133 1.705 0.0 X 10° 1.451 x 1077 0.0 
25.0 1.452 1.284 2.736 0:0 % 10-* 2.305 x 1077 0.0 
50.0 2.270 1.914 4.187 0.0 x 10°° 3.471 x 1077 0.0 
100.0 3.993 3.354 7.347 0.0 X 10° 6.109 x 1077 0.0 
150.0 5.636 4.730 10.37 0.0 X 1076 8.636 x 1077 0.0 
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TasBLE V—MiIe Extinction PARAMETERS AT 0.2 cm WAVELENGTH (150 GHz), H.O INpEx or REFRACTION 
3.039—1.575i, FoR Laws AND Parsons RaINn 





Phase 

Change 
Scattering Absorption Extinction Medium Index of Refraction 7% deg 

Rain Rate Coef. Coef. Coef. Op (22) 
(mm /hr) (km)~! (km)7! (km)7! ReGi — 1) Im(1 — 1%) 
0.25 0.05581 0.05445 0.1103 0.0 x 1076 0.2276 X 1077 0.0 
1.25 0.1828 0.1685 0.3514 0.0 X 10-6 0.6657 X 1077 0.0 
20 0.2991 0.2686 0.5677 0.0 x 10-8 1.042 xX 107 0.0 
5.0 0.4965 0.4349 0.9314 0.0 x 10-8 1.671 xX 107 0.0 
125 0.9766 0.8283 1.805 0.0 x 10-8 3.176 xX 1077 0.0 
25.0 1.596 1.315 2.911 0.0 X 10-6 5.044 xX 1077 0.0 
50.0 2.512 1.988 4.500 0.0 x 10-6 7.592 X 1077 0.0 
100.0 4.406 3.449 7.856 0.1 X 10-6 13.33 Xx 107 18.0 
150.0 6.212 4.846 11.06 0.1 X< 10-6 18.838  %* 107 18.0 
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TasiE VI—Mie Extinction PARAMETERS AT 0.3 cM WAVELENGTH (100 GHz), H.O InpEx oF REFRACTION 
3.505—2.0071, For Laws AND Parsons Rain 











Phase 
Change 
Scattering Absorption Extinction Medium Index of Refraction 7% deg 
Rain Rate Coef. Coef. Coef. —_——_—_———_—_—— —— (2) 
(mm /hr) (km)7! (km )7! (km )~} ReQn — 1) Im(1 — 7) ie 
0.25 0.04252 0.04991 0.09243 0.0 x 10-6 0.2886 x 1077 0.0 
1.25 0.1544 0.1586 0.3130 0.0 x 107° 0.8927 x 1077 0.0 
2.5 0.2634 0.2555 0.5189 0.1 xX 107° 1.482 xX 1077 12.0 
530 0.4520 0.4175 0.8695 0.1 x 1078 2.3842 xX 1077 12.0 
12.5 0.9211 0.8026 1.723 0.1 X 10-8 4.542 x 1077 12.0 
25.0 1.542 1.283 2.825 0.2 X 1078 7.320 X 1077 24.0 
50.0 2.502 1.957 4.459 0.2 X 1076 11.27) X 1077 24.0 
100.0 4.382 3.389 7.770 0.4 xX 10-6 19.72 1077 48.0 
150.0 6.153 4.751 10.90 0.6 x 1076 27.76 1077 72.0 
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TaBLeE VII—Mi5 Extincrion PARAMETERS AT 0.5 cM WAVELENGTH (60 GHz), H.O InpEx oF REFRACTION 
4.364—2.5211, For LAws AND Parsons RAIN 


Phase 
change 
Scattering Absorption Extinction Medium Index of Refraction 7% aa 
Rain Rate Coef. Coef. Coef. LC -- ( i) 
(mm /hr) (km)"} (km)! (km)7 Re(m — 1) Im(1 — ™) - 
0.25 0.01638 0.02856 0.04493 0.0 x 1075 0.2281 x 1077 0.0 
1.25 0.08590 0.1085 0.1945 0.7 107% 0.9094 x 1077 7.2 
2.5 0.1667 0.1876 0.3544 0.1 < 107° 1.608 X 1077 7.2 
5.0 0.3157 0.3236 0.6393 0.2 x 10-6 2.832 x 1077 14.4 
12.5 0.7145 0.6574 1.372 0.4 x 10-5 5.924 x 1077 28.8 
25.0 1.279 1.089 2.368 0.6 x 10-8 10.06 x 1077 43.3 
50.0 2.233 1,743 3.977 0.8 xX 107 16.59 x 1077 57.6 
100.0 3.934 2.999 6.933 1.3 X 10-6 28.89 1077 93.6 
150.0 5.505 4.165 9.670 1.8 xX 107 41.65 x 1077 129.6 
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TasLeE VITI—MIs5 Exrincrion PARAMETERS AT 1.0 cM WAVELENGTH (30 GHz), H.O INDEX oF REFRACTION 
5.9—2.9i, For Laws AND PARSoNsS RAIN 





Phase 
Change 
Scattering Absorption Extinction Medium Index of Refraction 7m de 

Rain Rate Coef. Coef. Coef. 2) (2) 
(mm/hr) (km)7 (km)7} (km)! Re(m — 1) Im(1 — 7) a” 
0.25 0.001459 0.009006 0.01046 0.0 x 107° 0.01046 X 1077 0.0 
1525 0.01303 0.04392 0.05695 0.1 x 107° 0.5248 X 1077 3.6 
2.5 0.03112 0.08465 0.1158 0.2 X 10-8 1.038 Xx 107 7.2 
5.0 0.07387 0.1617 0.2355 0.4 x 10-5 2.066 xX 107 14.4 
12.5 0.2210 0.3751 0.5961 0.8 x 10-5 5.106 Xx 1077 28.8 
25.0 0.4939 0.6890 1.183 1.4 X 10°° 9.978 X 1077 50.5 
50.0 1.071 1.234 2.305 2.1 X 107° 19.16 x 1077 75.6 
100.0 2.184 2.207 4.397 3.6 X 107° 36.26 x 1077 130.0 
150.0 3.280 3.106 6.386 4.9 * 1075 52.58 x 1077 177.0 
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TaBLE [X—M rE Extinction PARAMETERS AT 1.62 cm WAVELENGTH (18.5 GHz), H,O INDEX or REFRACTION 
7.001—2.5441, ror Laws AND Parsons Rain 

















Phase 
Change 
Scattering Absorption Extinction Medium Index of Refraction mm deg 
Rain Rate Coef. Coef. Coef. oO) (:22) 
(mm /hr) (km)7} (km )7} (km)7! ReQami — 1) Im(1 — 7) ” 
0.25 0.0001932 0.002970 0.003162 0.0 x 10-6 0.05064 1077 0.0 
1.25 0.002003 0.01814 0.02015 0.1 x 107° 0.2982 x 1077 2.2 
225 0.005166 0.03855 0.04372 0.3 x 107° 0.6316 X 1077 6.6 
5.0 0.01373 0.08067 0.09440 0.5 <X 107° 1.336 Xx 1077 11.1 
12.5 0.04672 0.2093 0.2560 1.0 X10" 3.544 Xx 1077 22°29 
25.0 0.1198 0.4172 0.5370 Lf 100° 7.320 Xx 1077 37.8 
50.0 0.3051 0.8111 1.116 2.9 X 107° 15.01 x 1077 64.5 
100.0 0.7365 1.525 2.262 5.2 X 10-6 30.19 x 1077 115.5 
150.0 1.209 2.210 3.420 7.4 X 1078 45.49 x 1077 164.4 
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Taste X—Mre Exrincrion PARAMETERS AT 1.88 cm WAVELENGTH (16 GHz), H.O INDEX or REFRACTION 
7.5—2.51, FoR Laws AND Parsons RaIn 


Phase 

Medium Index of Change 

Refraction 7% deg 

Rain Rate Scattering Coef. Absorption Coef. | Extinction Coef...-— —-————____—_ (;2¢) 
(mm/hr) (km)-! (km)-} (km) Re(m — 1) Im(1 — 17) km 
0.25 0.0001035 0.002018 0.002121 0.0 X 107 0.039384 xX 1077 0.0 
1.25 0.001083 0.01299 0.01407 0.1 X 10-5 0.2410 Xx 1077 1.91 

2D 0.002821 0.02836 0.03118 0.3 x 10-6 0.5216 xX 1077 5.74 

5.0 0.007659 0.06135 0.06901 0.5 X 1076 1.181 %* 107 9.56 
12.5 0.02667 0.1661 0.1928 1.0 X 1075 3.094 X 1077 19.1 
25.0 0.07037 0.3422 0.4126 1.8 X 10-6 6.524 107 34.4 
50.0 0.1853 0.6849 0.8702 3.1 X* 107% 13.58 x 107 59.2 
100.0 0.4643 1.308 1.773 5.5 X 1078 27.44 1077 105.5 
150.0 0.7701 1.907 2.678 7.9 X 1076 41.31 %* 107 151.0 
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TaBLe XJ—MieE ExtincTion PARAMETERS AT 2.73 WAVELENGTH (11 GHz), H.O INpEx or REFRACTION 
8.07—1.99i, For Laws AND Parsons RaIn 











Medium Index of Change 
: Refraction m deg 
Rain Rate Scattering Coef. Absorption Coef. | Extinction Coef.. _-A- 4, -——_—_ (=) 
(mm/hr) (km)~1 (km)7 (km)7} Re(m — 1) Im(1 — mm) ek 
0.25 0.00002176 0.0006630 0.0006630 0.0 X 10-6 0.01786 X 1077 0.0 
1.25 0.0002257 0.004325 0.004550 0.1 X 10-5 0.1123 X 107 1.3 
25 0.0005954 0.009876 0.01047 0.3 X 10-5 0.2525 X 1077 4.0 
5.0 0.001664 0.02377 0.02543 0.5 X 10-6 0.6008 x 1077 6.6 
12.5 0.005980 0.07349 0.07947 1.1 X 10-6 1.844 X 1077 14.5 
25.0 0.01652 0.1725 0.1890 1.9 X 10-6 4.330 X 107 25.0 
50.0 0.04603 0.3936 0.4396 3.4 X 10-6 9.956 X 1077 44.8 
100.0 0.1227 0.8482 0.9710 6.3 X 10-6 21.80 X 1077 83.0 
150.0 0.2057 1.303 1.508 8.9 X 10-5 33.72 X 1077 118.7 
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Taste XI]—Mrre Extinction PARAMETERS AT 5.0 cM WAVELENGTH (6 GHz), H.O INDEX or REFRACTION 
8.670—1.2021, ror LAws AND Parsons RAIN 











Phase 

Medium Index of Change 
Refraction 7 deg 

Rain Rate Scattering Coef. Absorption Coef. | Extinction Coef...-— —————_-_____ (2) 
(mm /hr) (km) (km)-! (km)7! Re(ai — 1) Im(1 — 7) . 
0.25 0.000001855 0.0001138 0.0001156 0.0 X 10-6 0.05728 X 10-8 0.0 
1.25 0.00001769 0.0005516 0.0005692 0.1 x 10-6 0.2604 xX 1078 0.7 
2.5 0.00004546 0.001138 0.001183 0.3 xX 1076 0.5262 xX 10-8 22 
5.0 0.0001254 0.002589 0.002714 0.5 xX 10-8 1.177 xX 1078 3.6 
12.5 0.0004493 0.007932 0.008380 1.0 xX 10-5 3.554 X 1078 fe. 
25.0 0.001290 0.02076 0.02205 1.9 xX 10-8 9.200 xX 10-8 13.7 
50.0 0.003760 0.05599 0.05975 3.5 X* 1076 24.63 x 1078 25.2 
100.0 0.01065 0.1509 0.1615 6.8 xX 1076 66.19 xX 1078 49.0 
150.0 0.01777 0.2542 0.2720 10.1 xX 1076 111.4 xX 1078 80.0 
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TaBLe XITI—Mir Extincrion PARAMETERS AT 7.5 cM WAVELENGTH (4 GHz), H.O InpEx or REFRACTION 
8.77—0.915i, For Laws AND Parsons RaIn 








Phase 

Medium Index of Change 
Refraction 1% deg 

Rain Rate Scattering Coef. Absorption Coef. | Extinction Coef.. |—-——————_, —-—_____—_ (Z) 
(mm /hr) (km)—1 (km)7! (km)7! Re(im — 1) Im(1 — 7) mm 
0.25 0.0000003639 0.00004853 0.00004889 0.0 X 1075 0.03631 * 1078 0.0 
1.25 0.000003433 0.0002078 0.0002112 0.1 X 1075 0.1456 xX 1078 0.5 
2.5 0.000008733 0.0003957 0.0004044 0.3 X 10-5 0.2712 x 10-8 1.4 
5.0 0.00002337 0.0007859 0.0008092 0.4 X 10-6 0.5307 xX 10-8 1.9 
12.5 0.00008091 0.002002 0.002083 1.0 X 1078 1.335 X 10-8 4.8 
25.0 0.0002185 0.004256 0.004474 1.8 X 10-8 2.823 XX 10-8 8.6 
50.0 0.0006030 0.009508 0.01011 3.3 X 1076 6.285 x 1078 15.8 
100.0 0.001629 0.02251 0.02414 6.4 X 10-6 14.90 x 10° 30.7 
150.0 0.002705 0.03549 0.03820 9.5 X 10-8 23.54 X 1078 45.6 
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TaBLE XIV—MIE ExtINcTION PARAMETERS AT 10.0 cm WAVELENGTH 
(3 GHz), H.O InpEx or REFRACTION 8.871—0.6280i, FoR Laws AND Parsons Rain 




















Phase 

Medium Index of Change 
Refraction 7% eve 

Rain Rate Scattering Coef. Absorption Coef. | Extinction Coef.. |_—-——--————--— — ( s) 
(mm/hr) (km)! (km) (km)7! Re(m — 1) Im(1 — 1) km 
0.25 0.0000001149 0.00002309 0.00002320 0.0 X 10~° 0.02296 x 1078 0.0 
1.25 0.000001081 0.00009474 0.00009582 0.1 X 10-5 0.08815 xX 10-8 0.36 
2.5 0.000002746 0.0001758 0.0001786 0.2 xX 107 0.1599 x 107-8 0.72 
5.0 0.000007326 0.0003369 0.0003443 0.4 x 1075 0.3017 x 1078 1.44 
12.0 0.00002528 0.0008147 0.0008400 1.0 X 1076 0.7200 x 10-8 3.6 
25.0 0.00006780 0.001627 0.001696 1.8 X 107° 1.480 x 1078 6.5 
50.0 0.0001857 0.003336 0.003521 3.38 X 1078 2.924 x 1078 11.9 
100.0 0.0004946 0.007142 0.007637 6.2 * 1076 6.302 x 10-8 22.3 
150.0 0.0008220 0.01099 0.01181 9.2 X 10-8 9.729 x 10-8 33.2 
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TaBLE XV—Mt1e Extinction PARAMETERS AT 15.0 cm WAVELENGTH (2 GHz), H.O InpEx or REFRACTION 
8.916—0.42201, For Laws AND Parsons RAIN 


0681 





Medium Index of 
Refraction 77, 





Phase 
Change 


Rain Rate Scattering Coef. Absorption Coef. | Extinction Coef. 
(mm /hr) (km)! (km)7} km)" ReQm — 1) Im(1 — %) 
0.25 0.002267 < 10-5 0.9807 X 1075 0.9830 X 10-5 0.0 X 1076 0.01459 x 10-8 0.00 
1.25 0.02129 x 10-5 3.894 x 1075 3.916 xX 10° 0.1 X 10-6 0.05408 x 10-8 0.24 
2.5 0.05403 xX 1075 7.087 XxX 10-5 7.141 xX 1075 0.2 X 10-6 0.09604 xX 10-8 0.48 
5.0 0.1489 x 10-5 13.22 xX 1075 13.37 xX 107° 0.4 X 1076 0.1760 xX 10-8 0.96 
12.5 0.4957 x 1075 30.73 xX 1075 31.23 X 1075 1.0 X 1076 0.4022 xX 10-8 2.40 
25.0 1.328 x 10-5 58.70 X 107-5 60.03 x 1075 1.8 X 10° 0.7612 xX 10-8 4.32 
50.0 3.622 x 1075 113.2 xX 1075 116.8 xX 1075 3.2 X 1078 1.456 7.68 
100.0 9.608 x 10-5 227.4 X 1075 237.0 X 1075 6.1 xX 10-6 2.941 14.64 
150.0 15.96 x 10° 341.7 X 1075 357.6 XX 1075 9.0 X 10-6 4.429 21.60 
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Taste XVI—Mie Extinction PARAMETERS At 21.0 cm WaveLEeNGctH (1.43 GHz), H.0 INpEx or REFraction 
9.00—0.2751, ror LAws AND PARSONS RAIN 








Phase 

Medium Index of Change 
Refraction 1 dae 

Rain Rate Scattering Coef. Absorption Coef. | Extinction Coef..__-———-—_, —$_____ (722) 
(mm/hr) (km)7 (km)7} (km) Rem — 1) Im(1 — 7) km 
0.25 0.0005909 X 1075 0.4531 1075 0.4537 *& 1075 0.0 X 107° 0.009192 * 1078 0.0 
1.25 0.005548 xX 107% 1.746 xX 1075 1.752 X 10-5 0.1 X 10-5 0.03347 xX 1078 0.17 
2.5 0.01406 Xx 10% 3.1387 X 1075 3.151 xX 1075 0.2 X 10-5 0.05885 xX 1078 0.34 
5.0 0.03748 1075 5.774 X 1075 5.811 X 10° 0.4 X 1076 0.1065 xX 1078 0.68 
12.5 0.1288 xX 1075 13.17 x 105 13.30 %* 10% 1.0 X 1078 0.2390 xX 1078 1.71 
25.0 0.3449 x 10-5 24.66 X 1075 25.01 X 1075 1.8 X 1076 0.4431 xX 1078 3.09 
50.0 0.93899 x 1075 46.29 xX 1075 47.23 X 1075 3.2 X 1078 0.8237 XX 1078 5.50 
100.0 2.491 x 1075 90.75 X 1075 93.24 X 1075 6.1 * 107° 1.619 xX 1078 10.5 
150.0 4.138 xX 10-5 {1384.8 x 1075 [1389.0 x 1075 8.9 X 1076 2.408 x 1078 15.3 
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TasLE XVII—MIE ExriIncrion PARAMETERS AT 0.6328 » WAVE- 
LENGTH, H,O INDEX oF REFRACTION 1.33—0.0i, For Laws AND 
Parsons RAIN 











Rain Rate Scattering Coef. Absorption Coef. Extinction Coef. 
(mm/hr) (km) (km) (km)7! 
0.25 0.08093 0.00 0.08093 
1.25 0.2482 0.00 0.2482 
2.5 0.3977 0.00 0.3977 
5.0 0.6519 0.00 0.4519 
12.5 1.273 0.00 1.273 
25.0 2.069 0.00 2.069 
50.0 3.221 0.00 3.221 
100.0 5.689 0.00 5.689 
150.0 8.046 0.00 8.046 





decreased from 21 to 0.1 cm. This phenomenon is also illustrated in 
Fig. 26 of a paper previously presented in this journal by Chu and 
Hogg (1968).° It serves to warn the reader that he should be very 
careful when applying the common rules of thumb relating wavelength 
and attenuation. 
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A Linear Phase Modulator for 
Large Baseband Bandwidths 


By C. L. RUTHROFF and W. F. BODTMANN 
(Manuscript received June 3, 1970) 


A linear phase modulator with a stable carrier frequency would be a 
useful component in radio systems—especially in coherent phase-shift- 
keyed PCM systems with baud rates of the order of 100 megabauds per 
second. 

The Armstrong modulator appears adequate for these applications; the 
circuit functions required for its realization are well understood and 
amenable to the techniques of integrated circurtry. 

In this paper, an analysis of the signal and distortion properties of 
the Armstrong circuit and variations of it are presented and applied to 
three system applications: as a replacement modulator for existing low- 
index analog systems; for multilevel coherent phase-shift-keyed PCM 
systems; and for frequency-division frequency-modulation multiplex 
systems which are of interest in radio trunk systems. 


I. INTRODUCTION 


A linear phase modulator with a stable carrier frequency would be 
a useful component for the following three applications. 


(2) As a replacement modulator for the reflex Klystron in an 
otherwise all solid-state repeater of the TL System.’ 
(iz) For frequency-division frequency-modulation multiplex systems 
with baseband bandwidths of the order of 100 MHz.’ 
(42) For multi-level coherent phase-shift-keyed PCM systems with 
baud rates of the order of 100 megabauds per second.° 


The modulator described in this paper appears adequate for these 
applications. It is based upon the original Armstrong circuit which is 
well suited to large baseband bandwidths and is reasonably linear for 
low modulation indexes.* An important feature of this method of 
modulation is that the carrier frequency can be stable with respect 
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to ambient effects since it can be derived from a temperature-stabilized 
quartz crystal oscillator. The baseband bandwidths which may be 
achieved are those for which low-index double sideband amplitude 
modulators can be built. 

An analysis of distortion is presented for the types of baseband 
signals used in the three applications discussed above, and a circuit is 
described in which the phase deviation can be increased to any desired 
value. 


II. CIRCUIT DESCRIPTION 


The Armstrong modulator is illustrated in Fig. 1. The baseband 
signal is modulated in a double-sideband suppressed-carrier amplitude 
modulator with a sufficiently low index of modulation to ensure suitable 
linearity. At the modulator output another carrier, 90° out of phase 
with the first, is added to the sidebands. The residual AM is removed 
by the limiter whose output is a low-index phase-modulated signal. 
The phase distortion can be made arbitrarily small by choice of the 
carrier to sideband power ratio at the limiter input; the result is a 
nearly linear, low-index phase-modulated signal. 

Let the baseband signal be 


e= v(t), with |v()| <1. (1) 


The output of the double-sideband suppressed-carrier amplitude modu- 
lator is 


€, = Mv(t) COS wot (2) 
where m SX 1 is the index of modulation. 


A quadrature carrier is added to e, in approximately the correct 
phase to obtain 














€p = Sin (wot + €) + mv(t) Cos wo. (8) 
PHASE 
BASEBAND DSB~SC LIMITER | MODULATED 
AMPLITUDE LIMITER 
OUTPUT 
SIGNAL V(t) MODULATOR SIGNAL 
SIN ( wot te) SINL wot te + (t)] 















STABLE See 
aa PHASE SHIFT 


Fig. 1—Armstrong phase modulator. 


LINEAR PHASE MODULATOR 1895 


e, = V1 + 2mvr(2) sin e + m’v*(2) 


sin fot tet tan? mess | 
where ¢ is small and represents any error in carrier phase. 

If this signal is passed through a perfect limiter the envelope be- 
comes constant, leaving an angle modulated signal whose phase modu- 
lation is 
mv(t) cos € 


e(f) = tan™ 1+ mo(t) sine (5) 


When the nonlinear distortion is small, the controlling distortions 
will be second and third order so terms in the expansion of equation (5) 
beyond the third will be omitted and (5) becomes 


v(t) & mv(t) cos « — m’v(t)’ sin € cos e + m’o(t)* sin’ € cos € 
3 
— a Ue) cos® e. (6) 


Ideally, « = 0 and the first term in equation (6) is the desired modu- 
lating signal; the second and third terms will be zero and the last term 
is the third-order distortion. When « # 0, second-order distortion 
occurs and the desired output signal amplitude is reduced by the 
factor Cos e. 

It can be seen from equation (6) that the distortion can be made 
as small as desired by the proper choice of m, which is proportional to 
the phase deviation. In order to determine suitable values of m, v(t) 
must be specified; we shall consider three signals of interest, corre- 
sponding to the three applications listed in Section I. 


2.1 Case I 


The signal v(é) is gaussian noise uniformly distributed in a band- 
width extending from O — W Hz. 

For nonlinearities of the type described in equation (6) the desired 
results can be computed by well-known methods.’ 


SM O0<lflsw, 








<|flsw, (8) 


Wile 
| 
a2 
| | 

| 
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where, 


So(f) = m’ cos’ e(c”/2W), with —W Sf S W, is the spectral density 
of the phase of the output signal, 

S2(f), Ss(f) are the spectral densities of the second- and third-order 
distortion terms, respectively. 

o is the mean square value of v(t), that is, the power in v(é), and 

mo is the rms phase deviation. 


2.2 Case II 


v(t) = > Q cos (np + q,)t. (9) 


The baseband signal, v(t), is a frequency-division frequency-modu- 
lated multiplex signal. Each term in equation (9) is an FM carrier with 
its own frequency modulation q,. Bennett has derived the number 
and types of modulation products produced by the last three terms of 
equation (6) for v(é) as in equation (9).° The second-order term of 
largest amplitude has the form 


€, = m’Q” sin € cos € cos [(m + n)p + (Gm + Qn) It. (10) 
Similarly, the controlling third-order product has the form 
3 
a= oy Q° cos* « cos [(l + me n)p + (qr + dm g)lt. (11) 
The total power in the signal of equation (9) is 


2 2 
= NS (12) 


where N is the number of terms in equation (9). From equation (6) 
the output phase modulation for an individual channel is 


€, = mQ cos € cos (np + 4q,)t. (13) 
The ratios of signal-to-distortion power for single modulation prod- 
ucts are, 
le P _ late | 
le. |? LmQsine}’ (14) 
la PL laa | 
les |? Lm?Q? cos” e] ° (15) 


In order to determine the total signal-to-distortion power ratios it is 
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necessary to compute the number of products falling in the kth channel, 


1 < k < N. Assuming power addition for these products the total 
signal to distortion ratios become 


(Se eee (x) 
D,  2m’o" sin’ « \N, (16) 


S 2 (x) 
D;  m’‘o* cos’ « \2N3 TZ) 





where 


N is the total number of channels, i.e., the number of terms in 
equation (9), 

N, is the equivalent number of m + n type products and includes 
other second-order products weighted in accordance with their 
contribution to the distortion power. It is a function of k and N, 
and 

N,; is the equivalent number of 1 + m + n type products and in- 
cludes other third-order products weighted in accordance with 
their contribution to the distortion power. It is a function of k 
and NV. 


Expressions (16) and (17) for the signal consisting of N sine waves 
are much like expressions (7) and (8) for the case of the noise-like 
signal. It has been shown by Bennett that the sum of randomly phased 
sine waves of equation (9) behave like noise as N increases without 
bound and if the power and bandwidth are finite.’ It is of interest to 
see in the present context how fast expressions (16) and (17) approach 
(7) and (8) as N increases; this is shown in Figs. 2 and 3. It is evident 
from the figures that the signal-to-distortion ratios are not a strong 
function of the number of channels, the ratios changing a maximum of 
2 dB while the number of channels goes from 10 to infinity. 

For a more detailed look at the behavior of the distortion products, 
the number of the various types of products falling in the kth channel 
for the 500-channel case are shown in Figs. 4 and 5. 


2.3 Case III 


In this case the baseband signal is a sequence of pulses which phase 
modulate a carrier in the format of a phase-shift-keyed system. A 
4-level polar baseband signal is written 


eo 


v(t) = Vo DY kp(t — nT), (18) 


n=-0 
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N/No2 IN DECIBELS 
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Fig. 2—The effect of the number of channels on the ratio of signal-to-second- 
order distortion. 


where, 


p(t) is the pulse shape, 
T is the time interval between adjacent pulses, and 


k, = +1, +3. 


In a 4-level PSK system, a maximum peak deviation of +37/4 radians 


4 
wn 3 
-! 
WwW 
a 
0 N=10 
a 
Zz 2 
~” 
Zz 
A 
~< 25 
Z 1 
100 
/ [i- (1/3) (#/w)2] 
[@) el 
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Fig. 3—The effect of the number of channels on the ratio of signal-to-third- 
order distortion. 
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Fig. 4—Number of second-order distortion products. 
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Fig. 5—Number of third-order distortion products. 
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Fig. 6—Modulator input and output pulses. 


is required. Deviations of this magnitude may be obtained by multi- 
plying the output of the modulator in a resistive multiplier circuit. 

As an example, suppose the modulator output is multiplied by four. 
The peak deviation required in the modulator is then 37/16 radians. 
Raised cosine input pulses, v(t), and the corresponding phase devia- 
tions in the output of the modulator are shown in Fig. 6 for this case. 
The output pulses were computed from equation (5) for « = 0. The 
value of m was chosen to result in a peak deviation of 37/16 radians 
for the pulse corresponding to k, = 3. For this example, m = tan 
37/16, and 





v(t) = ° E + cos at : . < 2, 

In ig. 6, the phase deviation, yg, is shown for pulses having k, = +1, 
+3. Some pulse compression is present in the larger pulse and the 
parameter m has been chosen for the correct peak deviation. For the 
smaller pulse the peak deviation is seen to be too large by about five 
degrees. If uncorrected, this error would cause the system performance 
to be degraded a few tenths of a dB.° The peak deviation can be cor- 
rected by a gain adjustment in the circuits in which the smaller pulses 
are generated.” 


III. MODIFIED ARMSTRONG MODULATORS 


There may be applications in which it is desirable that the output 
carrier frequency equal the frequency of the source carrier. The circuit 
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of Fig. 7 will accomplish this purpose while minimizing the degradation 
due to tones generated in the final mixer. The carrier frequencies of 
any high-order products of the two input signals which fall into the 
output band will be exactly at the carrier frequency of the output 
signal and result in minimum degradation. 

If the times (V — 1) frequency multiplier is replaced by a times 
M multiplier the flexibility in the choice of output carrier frequency 
is Increased while the feature described above is retained. In either case 
the frequency multipliers should be resistive rather than reactive. 

Finally, in the balanced modulator illustrated in Fig. 8 the phase 
deviation is doubled for a specified ratio of signal-to-third-order dis- 
tortion. 


IV. CONCLUDING REMARKS 
The Armstrong modulator has three attractive features. 


(t) The carrier frequency can be derived from a frequency stabi- 
lized oscillator. For example, a single source can be used in both 
modulators used to derive two cross-polarized channels for a 
short hop radio system or a satellite radio system. The identical 
carrier frequencies serve to minimize the effect of co-channel 
interference due to cross-polarization coupling. 

(zt) The functions required to realize the modulator—limiting, 
mixing, and multiplication—are amenable to circuit integration. 

(z2t) The modulator is suitable for very large baseband bandwidths, 
particularly high-speed pulse sequences for PSK-PCM systems. 


A short hop radio system has been described recently which has about 
the same communication capacity for either large index analog phase 
modulation or digital PSK-PCM.® In a system designed for either type 
of operation, it is convenient to do the digital processing at the inter- 
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Fig. 7—A modulator with output frequency equal to frequency of stable carrier 
source. 
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Fig. 8—Balanced Armstrong modulator. 
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Fig. 9—Repeater of configuration for analog phase modulation or digital CPSK- 
PCM modulation. 
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mediate frequency; if PSK-PCM is to be used, the IF amplifier can be 
replaced by a digital regenerative repeater and no other changes need 
be made (See Fig. 9). 


A digital regenerative repeater has been described which is appro- 


priate for this application; it requires a phase modulator with require- 
ments which are met by the configuration of Figure 7: that incidental 
AM be small, that the frequency be stable, that the linearity be adequate 
for multi-level operation, and that the power consumption be small.’ 
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Eventual Stability for Lipschitz 
Functional Differential Systems 


By GERALD A. SHANHOLT 
(Manuscript received April 3, 1970) 


In this paper tt ts established that for Lipschitz functional differential 
systems, the eventual uniform asymptotic stability of the origin is preserved 
under absolutely diminishing perturbations. 


I. INTRODUCTION AND NOTATION 


In two recent papers, A. Strauss and J. A. Yorke have investigated 
“eventual” stability properties for systems of ordinary differential 
equations.’'” In particular, they have shown that for Lipschitz systems, 
diminishing perturbations preserve eventual uniform asymptotic sta- 
bility. It is the purpose of this paper to extend a somewhat weaker 
form of this result to functional differential systems. Namely, it will be 
shown that for Lipschitz functional differential systems, the eventual 
uniform asymptotic stability of the origin is preserved under absolutely 
diminishing perturbations. 

The following notation will be used in this paper: E” is the space of 
n-vectors, and for x in EZ”, | z| denotes any vector norm. For a given 
number + > O, C denotes the linear space of continuous functions 
mapping the interval [—7, 0] into E”, and for ¢ in C, || ¢ || = sup | 4(8) |, 
—r <6<0.ForH > 0, Cy denotes the set of ¢ in C for which || ¢ || <H. 
For any continuous function «(u) whose domain is —r Su Sa,az2=2 0, 
’ and whose range is in £”, and any fixed t, 0 S ¢ S a, the symbol z, will 
denote the function x,(6) = x(é + 6), —7 S 6 S 0; that is, x, is in C, 
and is that segment of the function «(u) defined by letting u range in the 
intervalti -7rSu St. 

Let F(t, ¢) be a function defined on Dy, = [0, ~) X Cy into EZ”, and 
let <(¢) denote the right hand derivative of z(u) at wu = ¢. Consider the 
functional differential system 


z(t) = F(t, 2,). (1) 
1905 
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Let (s, ¢) be in D,. A function x(s, ¢)(£) is said to be a solution of 
equation (1) with initial function ¢ at t = s if there exists a number 
b > 0 such that 


(t) forte [s, s + 6), x,(s, @) is defined and in Cy; 
(22) x,(s, 6) = o; and 
(zit) x(s, o)(t) satisfies equation (1) fors Si<s+0. 


x(s, o)(t) is unique if every other solution with the same initial function 
¢ at t = s agrees with x(s, ¢)(£) in their common domain of definition. 

If F is continuous on Dy, then for every (s, ¢) in Dy there is at least 
one solution of equation (1) with initial function ¢ at t = s.° If, more- 
over, F is Lipschitzian in ¢, that is, there is a constant DL such that for 
every $1, ¢2 in Cy 


| F(t, b:) — F(t, be) | S L ld. — ¢e || (2) 


for £ = O, then there is only one such solution. Generally, under such 
assumptions, one can only expect solutions to exist over a finite interval. 


II, PRELIMINARIES 


We now define the stability concepts to be used herein. These defini- 
tions are stated for equation (1) in which it is assumed that for some 
H,0 <H S o, F is continuous and Lipschitzian on Dy. 


Definition 1: The origin is eventually uniformly stable (&vUS) if for 
every « > 0, there exists a 6 = 6(e) > 0 anda = a(e) 2 O such that 
|| x.(s, @) || < ¢ for all || || < 6andt = s = a. It is uniformly stable 
(US) if one can choose a(e) = 0. 


Definition 2: The origin is eventually uniformly attracting (EvUA) if 
there exists constants 7 > 0 and 6 = O, and if for every e > O there 
exists a J’ = T(e) > 0 such that || x,(s, ) || < ¢ for || @|| < 9,8 2 8B, 
andé 2 s+ T. Itis untformly attracting (UA) if one can choose 6 = 0. 


Il IV 


Definition 3: The origin is eventually untform-asymptotically stable 
(EvUAS) if it is both EvUS and EvUA. It is uniform-asymptotically 
stable (UAS) if it is both US and UA. 


The above definitions show that EvUS, EvUA, and EvUAS are 
weaker stability concepts than their respective Lyapunov counterparts: 
US, UA, and UAS. Also, it should be noted that in these definitions we 
do not require that the zero function be a solution of equation (1). 
When the origin is US, this implies that the zero function is a unique 
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solution of equation (1) for any s 2 O. Thus, we see that EvUS 
(EvUAS) is a natural generalization ‘of US (UAS) in which it is not 
assumed that the zero function is a solution. Finally, it is 1mportant 
to note that UA does not imply that the zero function is a solution 
(Ref. 1, example 2.8). 


Definition 4: Let V(t, 6) be a function defined for (é, ¢) in Dy. The 
derivative of V along solutions of equation (1) will be denoted by 
Viylé, z,(s, 6)] and is defined to be 


Venld, a.(s, $)] = lim sup 7 > {VIE + hy tals, 6)] — VIE, 20s, d)]}. 


If F is continuous and Lipschitzian, and if the origin is EvUAS, 
then the existence of a Lyapunov type comparison function can be 
established. By following D. Wexler* and A. Halanay”® one can prove 
the following theorem. 


Theorem 1: Let F be continuous and Lipschitzian on Dy, and let the 
origin be HvUAS. Then there exists a number K,0 < K < H,anda 
function V(t, ¢) with the properties: (t) there exists functions a(r), b(r) 
continuous, positive, and monotone increasing for r > 0, with a(0) = 
b(O) = 0, such that for m in (0, K] 


a(ll¢ ll) S$ VG) S dv( | ell) 


form S ||¢|| S K,t = d(m), where d(r) ts a continuous, nonnegative, 
and nonincreasing function for r > 0; (iz) there exists a function c(r) 
continuous, positive, and monotone-increasing for r > 0, with c(0) = 
such that 


Vinlt, t(s, ¢)] S —ell| x.(s, ¢) 1] 


for ||¢|| S K, = s = d(K); and (iii) forO <r S |/¢; || S$ K 
t = d(K) 


| Vd, o:) — VG o) | S M(r)|| eo: — : |], 


where I(r) is continuous and monotone-decreasing on (0, K]. 


III, PERTURBED EQUATION 


We now prove a theorem which shows that the EvUAS of the origin 
of the nominal equation 


y) = FG w) (N) 
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is preserved for the perturbed equation 

éQ) = FG, 2.) + GG, x) (P) 
when F and G satisfy certain conditions. In particular, G(, ¢) is required 
to be absolutely diminishing, that is, for every m in (0, H), there exists 


&Ym = O and a function g,,(¢) continuous on [y,,, ©) such that for 
ms ||o||<H,t2 7. 


IG(t, )| < ga(t) and I,(t) 4 i {HEA MS: toe: 


Theorem 2: Suppose that F and G are continuous and Lipschitzian 
on Dy, that G ts absolutely diminishing, and that the origin is EvUAS for 
equation (N). Then the origin ts EvUAS also for equation (P). 


Proof: Define J,,(é) = sup [J,,(s):t —- 1S s < ] fort = 1. Since 
I,,(t) 20 ast— o, this implies J,,(£) ~ 0 monotonically ast — o. 
Let 0 < e S K, choose ||¢|| < 6(e) = b”’[a(e)/2], and pick 
= 6(e) = O and such that 
2LM (6) Js(t) < min [a(e), c(6)] (3) 
for t 2 0, where L is the Lipschitz constant associated with F. Then 
fort = s = a(e) = max [I1, O(c), d(d)], || 2. (s, o) || < ¢. Suppose not, 
that is, for some t = s, || x.(s, ¢) || = ¢. Let q be the first ¢-value greater 


than s for which || z,(s, ¢) || = ¢, and let p be the last ¢-value less than 
q for which || z,(s, ¢) || = 6. Then 


6 <= ||2,(s,¢) || S¢ pSts¢. (4) 
For ¢ in an interval on which x(s, ¢)(£) exists, we evaluate 
Viet, v.(s, )] S Vanlt, a.(s, 4)] 
+ lim sup 7 (Vit + h, trsalt, 2(s, $)]} 
ho + 


— Vit +h, yerslt, x1(8, )]}) 
—e[||x.(s, 4) | I] 


IIA 


+ lim sup mM { [jaesalt, x,(s, ¢)] 
ho + h 


— Yrrall, r1(s, )\{I} 


where the function V is as described in Theorem 1. By assuming—with 
no loss of generality—that L > 1, we obtain’ from the above inequality 


Vile, 2.(s, 6)] S —el ||x.(s, 4) |] + LM | Gt, x.(s, 4)] |. 
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Employing the absolute diminishing character of G and equation (4), 
we obtain by integrating the above from p to q 


a(e) S$ 0(8) — (q = pels) + LM | gs(t) at. (5) 
Using the easily shown fact that 
t t 
[ oG)dssf 1.@)ds, t2uz1, 
uu u-l 


and equations (8) and (5), we see that 
a(e) S b(5) — (q — p)c(d) + LM(q — p + I)Ja(p) 
< b(6) + a(e)/2 = a(e). 


Hence, we arrive at a contradiction which shows that the origin is 
EvUS. 


Lebqe 3008 = al) and 
T(e) = a(e) + 2[LMJ;(1) -+ b(K)]/c(6). (6) 
Consider s = 8 and ||¢|| < 7. Thus, 2z(s, ¢)(é) exists for allt = s 


Moreover, since the origin is EvUS, to prove EvUA it is sufficient to 
show the existence of au, s + a S$ u< s+ 7, such that || x,(s, ¢) || < 
5(e). Assume the contrary, that is, 


S ||u(s,¢) ||S K,stasigs+T. 
Employing the same procedure as above, we arrive at the estimate 
a(6) < b(K) — (T — adc(6) + ML(T — a+ 1)J3(s + a). 


Using the monotonicity of J; and equations (8) and (6), we compute 
a(d) < o(K) — 2 wr — 9) + MEI) = 
This contradiction then completes the proof of this theorem. 
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Information Theory and Approximation of 
Bandlimited Functions 


By DAVID JAGERMAN 
(Manuscript received April 15, 1970) 


For bandlimited functions, simultaneous approximation of a function 
and several of its derivatives is considered. Concomitant entropy estimates 
are obtained. A feasible algorithm for the transmission of information ts 
discussed. This algorithm has been applied to the design of a class of PCM 
systems.” 


I. INTRODUCTION 


It is the purpose of this paper to discuss both the best approximation 
of sets of bandlimited functions under Sobolev norms and the con- 
comitant information-theoretic estimates. The Sobolev norms are 
useful when it is desired to approximate simultaneously the function 
and some of its derivatives. This requires an amount of information 
beyond that for approximating only the function. Section II gives the 
necessary background definitions of width, entropy, and capacity; 
theorems providing representations of bandlimited functions, as well as 
a form of Mitjagin’s inequality relating approximability to entropy, 
are proved. The distinction between capacity and entropy is comparable 
to that between communication and storage, since capacity refers to 
the number of distinguishable functions transmitted from a signal 
source while entropy measures a bit requirement for the reproduction 
of a function to within a specified accuracy. A constructive approach 
to communication requirements implies an explicit means of representing 
any function of the signal source by numbers with a uniformly bounded 
number of digits. The procedure or algorithm used is usually obtained 
from an infinite series representation with subsequent truncation and 
quantization. Pulse code modulation systems provide examples of this 
procedure. Section II gives a precise definition, while Section III presents 
an explicit construction of a feasible algorithm. This algorithm has been 
applied to the design of a class of PCM systems.’ 


1911 
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Sections III and IV contain the theorems and proofs which provide 
upper bounds on widths and entropies. Section III discusses signal 
sources with finite instantaneous power. Section IV considers signal 
sources in which the total energy is finite. 


II. PRELIMINARIES 


Let A be a subset of a Banach space X; it is desired to approximate A, 
that is, uniformly all elements of A by means of n-dimensional sub- 
spaces X, of X. The deviation Hy,(A) of A from X, is defined by 


Ex,(A) = sup inf eg |l. (1) 


The deviation provides information on how well A may be uniformly 
approximated by elements of the given space X, ; however, another 
choice of X, might provide a smaller deviation. Accordingly the nth 
width, d*(A), of A relative to the space X is defined by” 

dx(A) = inf Ex,(A). (2) 

XncX 

If the infimum is attained, then a corresponding X, is called an extremal 
subspace. The following properties are immediate. 


0<d%,,(A) S$ da%(A), n20, (3) 
9(A) = sup | x |l, (4) 
BCA=>4d*(B) S d¥(A). (5) 


If X has finite dimension m, then d*(A) = 0 for n = m. 

A set of sets whose diameters do not exceed 2e(e > 0) and whose 
union contains A is called an e-covering of A. A finite set S C X such 
that for fe A there isage S with ||f — g|| S eis called an enet of A. 
Clearly d*(A) S ¢ for a set A possessing an e-net of n elements. If A is 
totally bounded then lim,.., d3(A) = 0. To see this, choose a covering 
of A consisting of n e-balls, then their centers constitute an e-net of A. 

Let N.(A) (presumed finite) be the number of sets in a minimal 
e-covering of A; then the absolute eentropy, H,(A), of A is defined by 


H(A) = log N,(A) (6) 


in which the logarithm is taken to base two.” * 


Let N*(A) be the number of elements in a minimal e-net S C X of A; 
then the relative e-entropy, H*(A), is defined by 


H(A) = log N*(A) (7) 
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in which the logarithm is taken to base two.” * For A totally bounded, 
let 21, -** , Z, be the elements of an enet, and let Bj(1 S$ j S n) bea 
ball of radius ¢ about z; ; then the sets U; = B; © A constitute an 
e-covering of A; hence 


H(A) = H7(A). (8) 


The minimum number of binary digits, d, of an integer expressed in 
radix two needed to identify uniquely every element in a minimal 
e-covering of A satisfies 


[7.(A)] Sd S [H.(A)] + 1 (9) 


in which [2] designates the integral part of x, that is, the unique integer 
satisfying x — 1 < [2] S x. Thus H,(A) may serve as an absolute 
measure of efficiency for processes designed for the storage and trans- 
mission of information. 

Let a set w of m real numbers be chosen, and also a mapping from A 
onto 2, = w X --: X w (p times); that is, 


reA—-a = (a,, °°: ,a,)€2,,a1,°°° , abo. 


Let the algorithm I define a one-to-one and onto mapping of Q, to an 
e-net S of A in which [(a) e S approximates ze A to within ¢; then the 
volume V (1) is defined by 


V(L) = plogn. (10) 
In view of expression (8), one has 
V(T) 2 H(A) 2 H(A). (11) 


Thus the greater V(T) is, the less efficient is the algorithm I’ compared 
to the absolute standard H,(A). 
If D C A has the property that 


f #49, fgeD=>||f -—gll>. (12) 


then D is called e-distinguishable. Let 17.(A) be the number of elements 
(presumed finite) in a maximal e-distinguishable subset of A, then the 
e-capacity, C’.(A) is defined by 


C(A) = log M,(A), (13) 


the logarithm being again taken to base two.* For a transmission 
system, C’.(A) measures the number of distinguishable signals of the 
source or of the processed signal at the output of the receiver depending 
on the identification of A. The following inequalities hold between 
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e-capacity and e-entropy: 
C,(A) = H(A) S C.(A). (14) 


To show this, consider the inequality on the right. Let D be a maximal 
e-distinguishable subset of A; then e-balls about each element of D 
constitute an e-covering of A for, otherwise, there would be an xe A 
not covered and hence more than e away from every element of D. This 
would contradict the maximality of D. For the inequality on the left, 
let D be a 2e-distinguishable subset of A, then the number of elements 
of D cannot exceed the number of covering sets of diameter 2¢ or less 
in an e-covering of A for, otherwise, there would be at least two elements 
of D in the same covering set. This would contradict the 2¢-distinguish- 
ability of D. 

It is possible to bound H*(A) above and below in terms of d*(A) 
(refer to Ref. 2 where Mitjagin’s inequalities are given). An improved 
form of Mitjagin’s upper bound is proved below. 

Theorem 1: Let A be a totally bounded subset of a real, normed, vector 
space X. Let the nth widths relative to X be d*(A), and let 


N = max [n :dx,(A) = (1 — ade] 
with @ an arbitrary number satisfying 0 < a < 1; then 


x 
H(A) < N log (2% 4 2—4). 
QE a 
Proof: Let Hy be an N-dimensional subspace of X for which H'z,(A) < 
(1 —a)e, then Vre Arye Fy D || 2 — y|| < (l — ade. Let Ay be the 
set of all such y for every xe A. An ae net of Ay is an e-net of A; hence 
H*(A) S H% (Ay) S C..(Ay). Let y1, +++ , yx be an ae-distinguishable 
subset of Ay , and let B, C Ey be balls with centers y, and radius gee, 
then they are disjoint and are all contained in the ball B with center 
the origin and radius dj + (1 — 4a)e. Let Ay be the volume element 
in Hy ; then \yM(Sace)” S dyl[d* + (1 — 4a)e]”. The inequality of the 
theorem follows on taking logorithms. 

The class of functions to be studied consists of the space B, defined 
by the conditions that f(é) « B, be analytically continuable into the 
complex plane as an entire function of exponential order one and type o, 
and that it be bounded on the whole real axis — 0 <t< o. The follow- 
ing inequality is valid for B, 2° 


_sup | f®|se _sup 14 |. (15) 


Important subspaces of the space B, are the space C, defined by 


BANDLIMITED FUNCTIONS 1915 


feC, = A(n) = o(e"”) (16) 
in which 


A(n) = 0p [f€ +n) |, & 7 real, (17) 


and the space W, defined by 
feW, >feL(—o, ~). (18) 


Several representations for B, exist;° however, the following repre- 
sentations are needed for the present investigation. Let 


sin ct 
ot 


$(t, 0) = o(t — jh,e), h=n/o, (20) 


then one has the following: 
Theorem 2: 





b(t, o) = , (19) 


foo} 


(DeC, Sf) = DF fGh)d,(t, ) 


for all complex t. The series converges untformly in every closed, bounded 
region. 
Proof: Consider the integral 


ell (6) 
277i Joy (€ — ft) sin of 





N he ¢ = E i an, (21) 
taken over a square Cy with corners at (NV + 4) (41 + )h, and N so 
large that ¢ is in the interior of the region bounded by Cy . The theorem 
is clearly true when ¢ = kh (k integral); it will hence be assumed ¢ ¥ kh 
for any integral k. The index N = 0 is an integer. Evaluation of Iy by 
use of residues yields 


f= Qe FGMos(t, 0) + Ty sin ot; (22) 


thus, to prove the implication to the right, it is sufficient to show Iy — 0, 
N—- . Let IY be the integral (21) extended over that part of Cy given 
by & = (N + $)h; then 





1 ] 
jie [s oe 
is ___A(y) 
i |(N + d)h + ty — t | | sin (QW + 4) 4+ ton) as (23) 
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Since 
[N+ A +in-—t| 2 NV 4+5A- ltl, 
| sin (x(N + 4) + ion) | = cosh on = 4e™", 
one has 
<i (N+4)h 


Lay e "|" A(m) dn. (25) 


a (N + Be silts | fe 
Writing equation (25) in the form 


<2 2 1 i Le ef 
Bie __QN +h cea Itt A(n) dn, 


a (2N + l)hh— 2|t|QN + (W+Ph 
(26) 
using equation (16) and the following lemma’ 
oe to is 
i) 0, |n[ooslims. [| f@dr=0, — @2) 
T-0 2T —-T 


shows that I? — 0 uniformly in t. The same conclusion applies to the 
integral extended over § = —(N + 4)A. 
Let I be the integral (21) extended over » = (N + 4)h; then 


(2) 
| In |s = oy 
> ieee jE + oN + 4)h) (28) 
(N+h)h | é + iN + §)h — t | |sin (& + ix(N + 9) i 
Since 
[e+iN+phA-—tl2 (N+ ph —|t|, (29) 
|sin (of + iW + 9) | = ero, 
one has 
2 2 (2N + Ih ~stNeH | 
eS ) < (N+3 i ; 


In view of equation (16), [{?? — 0 uniformly in ¢. The same conclusion 
applies to the integral extended over » = —(N + 4)h. For the implica- 
tion to the left, one may observe that ¢,(i, c) « C, , and that the series 
converges uniformly. 

The series of Theorem 2, which is clearly interpolatory, is called the 
cardinal sertes,* 


BANDLIMITED FUNCTIONS 1917 


For f(t) e L’(—, ©), the Fourier transform relations are given by 


Fw) = Gor [1 at, (31) 
1 - fut 
10 = Gp fe F@ au. (32) 
The Fourier transform of ¢;(¢, c) is 
1 3 —iuy 
B(u, o) = ie (z) é . | u | <9; (33) 
= 0, |u| >o. 


The Parseval relation now shows that the sequence ¢;(t, ), -~ <j< 
is orthogonal over (— ©, ©) with respect to unit weight; thus, 


[4 odbslt, oat = 0, jk; i 
=h, jek. 


The following theorem may now be stated for fe W,. 
Theorem 8: fe W, 


= [11 Par= aX 1G Fr 
1) = Gor | Fw) du, 


Fw) = ge DL GMe, Jul <e. 


Proof: The Paley-Wiener theorem® shows that f « W, has the repre- 
sentation given in Theorem 3; hence, by the Cauchy—Schwartz inequality 


\f~E+ in) |S (siuhen 1 | F(u) au} = o(e7'"'). (35) 


Equation (85) shows that W, C C, ; thus, by Theorem 2, f is in the 
closure of the system ¢;(¢, c), —%0 <j < . The Parseval relation now 
follows from equation (34). To establish the formula for F(u), it is 
only necessary to show 


o N 
i eit! = f(jhye du — 0, M,N—-o, M, N-—-o, (36) 
-o 7=M 


because each term is the Fourier transform of the corresponding ¢; term 
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of the cardinal series. One has 


o N 
/ gu > f(jhye" ** du 


7=M 


2 


du, (37) 














> f(ghye™" 


2 o 

< 20 | 
o N — ? N 

| i. a fhe" du | < 40? > | f(jh) |? 0. (38) 
-o¢o j=M j= MM 


The limit zero is obtained as a consequence of the Parseval relation of 
Theorem 3. 
To obtain a representation for the class B, ,*° let 


Wo=Aoataglducs) 


m > 0 integral, 0<6<i1, 
a(t) = o(¢ — jh), h="(1— 4); (40) 


then one has 
Theorem 4: feB, 


= 1) = 2 GN) 6,0. 
The series converges absolutely and uniformly in every closed, bounded 


region. 
Proof: The function 


; 6a _ 
|sin cd es Oe 
(0 aga (41) 
(1 — 6)m tae 


belongs to W,,(:-3) for each positive integer m and arbitrary s, hence the 
cardinal series applied to this function yields the expansion 


. 60 i 
[sin ae ee | 
f(g) 4" 


@— djme— 9 
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Let s = t; then the required representation is obtained. The absolute 
convergence follows from the boundedness of | f(jh) | and 


| 9: | = OC |"). (43) 


Approximation will be studied in the uniform norm and the following 
Sobolev norm 





Wl = 9 [UO P+ me ORE +m 10 Patt (44) 


in which yp, , -+- , #, are positive numbers. For the space B, , the symbol 
By, will be used for the corresponding normed space. The symbol 


B? ,(M) will be used for the subset defined by | f(t) | $ M, —# <t< 0. 
For the space W, , the corresponding normed space will be denoted by 
Wz, , and W? ,(B) for the subset in which 


{[" 110 Pat) se. (45) 


III. THEORETICAL INVESTIGATION oF B, 
Let B% designate the vector space B, normed by 


Wf lle = max 110 | (46) 


-T/2stsT. 
and let B7(/) be the subset of B% satisfying 
lf@| SM, -x <t<o. (47) 
The completion of B* is the space C” of functions continuous over 
[—7/2, T/2] and normed by equation (46). 
Let 


ol’ 2c\? 2c 
c=) = 1- (2), n>, (48) 


Ton | 
=> Co — > . 
m | D6 (n — 1)|, m2 1; 


then the following theorem provides a bound on the nth width, 
d°"(B™(M)), of B7(M) relative to the space C’. 

Theorem 6: d°"(BT(M) S< (2M/xm)e™”. 

Proof: ‘The series representation of Theorem 4 will be used. The func- 
tion 


gt) = De $Gh)8(0 (49) 
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establishes an approximation to f(é) whose error is given by 


f(D) — 9) = Xo HGH. (50) 
From equations (89) and (40), one has 
1 {m\” 1 : T T 
oo 1s 4(®) (eae lil>a. ltlag: (51) 
Define the function p(x) by 
p(t) = 3-2 0s2<il, (52) 


p(x +1) forall gz, 


then the Sonin (Euler—Maclaurin) summation formula” is 


> wa =f We) ax + ore) | — f o@wra) ds (8) 


a<jsb 


in which a < 6 are arbitrary numbers. Use of equation (53) with 








at m\" Pores Lae 
W(x) = = (2 ae : eS Ths (54) 
Oh 
a=N-+3, b = 
yields 
2 m ss 
Ole = (55) 
lil 7m (w +4 - 2) 
= 2° 2h 
Let 
6 1 T : 
m=("@(v42-Z)] 21, (56) 
then 
S VOO 1s ee (57) 
FES ae ~ 7m 
Thus, from equation (50), one obtains 
2M _n 
lf-glhS— e™, (58) 
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and hence 
dius( BED) eae ee (59) 
7m | 
For n odd, one has 
oo {-[B ODI} 
ac"(szan) = 2 fe vl (60) 
"  [Be-9) 
2e h 
while if n is even, one has d°* < d°", ; hence, in all cases 
oo{-[E 1-3} 


[E619] 


The fractional guardband 6 is now chosen as in equation (48) from which 
the inequality of the theorem follows. 

When n is large, a more accurate estimate of d°” may be obtained by 
using a polynomial approximation to B7 . Let 


Ko = (Ex) = 9), (62) 


and let L(x) be the Lagrange interpolation polynomial established for 
g(x) on the zeros of T,,(x), the nth Tchebysheff polynomial of first kind, 
over [—1, 1]; then the standard error formula for Lagrange interpola- 
tion” yields 


ee | 9) — L@) | S 5a nee x | g” (2) |. (63) 


-lszs 


1 
n! 2”"" 


Bernstein’s inequality (15) and equation (62) now yield 


100 - HY | aoe (64) 


nl 





hence one has 


Theorem 6: 
ae"(B? an) s aids ae) o eg 


Let H” be the space of functions f(t) possessing derivatives up to 
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order s satisfying f, f, --- , f°” e L’?(—T7/2, T/2) and normed by equa- 
tion (44); then Theorem 7 provides an estimate of the nth width of 
B? ,(M) relative to H7 . 


Theorem 7: Let 





s—] \ 
= ber ee 
T= ‘Se TPs =f = 1)! [? (2s =, Op 1) (s <— r) + as »} 3 
in which po = 1 and the sum is considered zero when s = 0, then 


MT T*(2c)"** 
nt (*"\en + 1) 


Proof: For the function g(x) of equation ie the identity 


di:)(Bz..(M)) Ss 





ge) = P@) + [ Sag gu) du, (65) 


(in which P(x) is a polynomial of degree not exceeding s — 1), will be 
used to obtain a polynomial approximation to g(x) in the Sobolev norm 
(44). Let L(x) be the Lagrange interpolation polynomial for g‘(z) 
formed with n nodal points on [—1, 1] and w(x) the corresponding 
fundamental poynomial; then one has 


s il nts 
g(r) = L@) +7 9" Oo@), Fe [-1, I). (66) 
The polynomial J(z) defined by 


igs Pay’ Ls el RG ee (67) 


= 


will be used to approximate g(x) in the Sobolev norm; its degree does 
not exceed n + s — 1. Let 


oo Gy sat, bes (68) 
then, from equation (66), one has 


lg? @) — P'@) | 


< tase [ Gayle | du, Sr <3 (69) 


| 9a) — 1) | s Hee | (a) |. (70) 
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The norm (44) for the interval [—1, 1] may be written 
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1 8 
lo-rik=[ Se loP@-L°@ Par, a 
-1 r=0 
in which ») , -:: , v, 2 0; the v, and y, are related through the change 
of variable ¢ = (7'/2)zx. Using equations (69) and (70), one has 
Mise 
We ieee oe) re" aie 
1 s—1 x s—r-1 2 
(e@—4 ) ‘ 
a p> »A heresy, | w(u) | du} + v,w(x)"¢ dx. (72) 
Define the function k(u, v) by 
1 
ke, v= id ea) ay de (73) 
max (u,v) 
then equation (72) may be written: 
2 s—l 
_7\Ppe< Mase} pete 
et lt Se ee ae 
1 1 1 
| / k(u, v) | o&)wv) | du dv + », / w(a)” ar (74) 
-1 -1 cao 
The Cauchy-Schwartz inequality shows that 
(d — uF — a)? | 
k(u, v) = prs _ or = 1 d (75) 
hence, 
2 MM, <= Vv 
= < stints r 
II 9 Tl, 3“ Oe Eerie op 
(f (l— ut"? | w(u) | au) +», / w(x)” dnp. (76) 
Further application of the Cauchy—Schwartz inequality yields 
Mass 
ied |S eee 
s—1 v Daraae 1 ‘ 
1% Qe —r— DF @s—a#—-De-n* »p iz oD eas 00) 


A good choice for w(z) is 
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pay (78) 
11) 


") 
n 
where P,(x) is the nth Legendre polynomial. The coefficient of P,(x) 
in equation (78) makes w(x) monic. Since 


w(t) = 


1 
2 
2 ee —— eee 
[ P,(a) dx = ont 1? (79) 
one obtains 


91h ey Ges) 


eat Me aed i 
p> 16 = Ff Dl =a Dean »} - (80) 


The Bernstein inequality (15) shows that 








Mise & Mc"; (81) 
hence 
M2%c"*s 2 ) 
llg-Ill.s b=5 


2n 
“() 
ee eee nh 
\24o¢@—r-DP@—-a#—-Dpe-n sf © 


Finally the change of variable t = (7'/2)x and the replacement of v, by 
the original yu, yield the result of the theorem. . 

The results of Theorems 5, 6, and 7, may be translated into estimates 
of entropy by use of the Mitjagin inequality of Theorem 1. The estimates 
so obtained will apply only to the subset of B,(M) for which f(¢) is real. 
Doubling the bounds will provide estimates for complex valued f(é). 


Theorem 8: LetO0<a<1,(1 — ade < (2M/ze), f(t) real, 


2M 
ess m(1 — aye ; 
(1 — ae 2M 2M : 
bee m1 — ae a m m1 — aye 


2M 
m =|In 
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a@iany 3 {2+ [2 (1+ (Eo +n) [prow CM + 
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a= a), 





Proof: According to Theorems 1 and 5, one must solve the inequality 


gs e” = (1 — ae 
7m 
for the largest integer m; thus 


me” < ery : 
Consider the function 

F(z) = 6—2x-— Inga, 6> 1. 
One has 


Te oe 
F(z) = -1 me 


hence, by the mean value theorem, 


1 


Ps —m) = —mns +a(L +4), eet ee ee 


g 
Let 
F(é — h) = 0; 


then, since f is positive, 





o<-mstni+—4), 


0<—é6Iné6+A01+6+4+1n 5) — fh’; 


thus 
6 In 6 
aa, Geer oa PET, 
and 
1+ 6 
a ee Ben aN ig Far 


The inequality 
o-PpiIn gs 6 


(83) 


(84) 


(85) 


(86) 


(87) 


(88) 


(89) 


(90) 


(91) 


(92) 


(98) 


1926 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1970 


is thus satisfied by 


1+ 6 ; 
seceoes GSI NG (94) 
hence, setting 
2M 
6 = In ed sae (95) 
and taking cognizance of the integral character of m, one has 
ide 
in 2M m(1 — ae (96) 
Wea ME Oe pg ye OM ns ee 
m(1 — ae m1 — ae] 
provided 
Game =. (97) 
Te 
For the computation of d,_, , one has from equation (48) 
m= lz do(n — 2) | . (98) 
Hence 
T .2 
2% 6-2 — 2) < m + 1, (99) 
2c *\? 2e 
{1 os (==) hm - 2 <= (m + I). (100) 
Let 
ee ee (101) 
us 
then 
1\* (e 
v\l — Fy <pim +); (102) 
accordingly 


< ul a (: (m + ») } (103) 
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Thus 7 satisfies 


nS2+ E {1 a ( (m + ») + | (104) 


the theorem now follows from equations (96), (97), (104), and Theorem 1. 
When eis small, a more accurate estimate of entropy may be obtained 
by use of Theorem 6 in place of Theorem 5. Accordingly one has 


Theoren 9: LetO<a<1,n = Q2M/(1 — a) e(ec)*), 


sls “a 3 
n = max (t ,é F f(t) real; 
then 
H (Bz(M)) 
| 2Ing +$—4$in(2 Ina) || 


9M . 2— 
_ 9 1 Hoe (SE + 2). 
in (2m) 41+ 50 J 





Proof: According to Theorem 6, one may consider 


Stirling’s formula provides the inequality 
nl > n"e*(2Qxrn)?, (106) 


and hence one may consider 


2M - 
(nn)! (=.) = (1 — ae. (107) 
Let 
ec 2M 
oes 9% oe, (1 — ae(rec)? , vee) 
then equation (107) becomes 
gt *itee < ne’, (109) 


Consider the function 


F(z) = 6— («+ a) lng; (110) 
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then, by the mean value theorem 


oe a) _W/(l_ a 
F(6 — h) = 6 (@+ams+hns+1+9) (3 ‘), 


Shee 6. 019) 








Let 
zx=6-h2a, F(6 — h) = 0; (112) 
then 
0S5—-(+a)msthin(sti+§). (113) 
Let 
6 = l/e; (114) 
then 
ecg asa Eee (115) 
Iné+1+ ; 
and hence 
2 < 2ita-aln 5 (116) 
né+1+5 
Thus, in terms of n and 7, one has 
2Ing-+$—4in(2 nn) 
ns 9 1 (117) 
in (2. m ») ze © Olag 


The lower bound on 7 in the theorem assures the satisfaction of the 
conditions on z and 6 in equations (112) and (114). Use of Theorem 1 
now provides the inequality of the theorem. 

Theorem 10 provides an entropy estimate deduced from the width 
result of Theorem 7. 


Theorem 10: 0<a< 1,7 = (MI(2c)*/(1 — a)e(ec)*) 
n 2 max (2 ’ ”) ’ : dl he io oa jg), {() real; 


then 
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A (Bz, (M)) 





2 
ny+ 3 g in oe (aT, 2—<). 


<jsti+ 9 1 log 
| in (Ztnn) +1 +52-|| 


Qe a 





Proof: The investigation parallels that of Theorem 9. A difference 
occurs in the estimation of d, . The Bernstein inequality of (15) shows 
that 


do S MyT’; (118) 


hence the estimate of the theorem. 

The case m = 1 of the representation given in Theorem 4 may be used 
to obtain an explicit e-net for B,(/), and hence to provide a constructive 
algorithm for the transmission of information from such a source. The 
representation for f(t) e B,(M) takes the form 


in oe 
cance emer 
oo Co 


(¢- jh) 725 - h 


ch =x(l — 6). (119) 








(§ — jh) sin 7— 5 — jh) 


1 = 1G 


) 








1— 6 


In order to proceed, it is necessary to estimate the quantity A(6é) 
given by 




















sj, NiO fone et pen 
1— 6 1— 6 
AOS eae ee oe | 
7. gt) Pa Gt 
Theorem 11: A(8) S$ 1/8 for0 <6 <1. 
Proof: The Cauchy-Schwartz inequality yields 
: 60 pact 
» [snp 5 Mt — ah) 
AG) Supt 2. 
—ao<ti<o j=—0o 6a (t _ jh) 
1— 6 
o 2 
a sinj = 5 6b — ah) 
sup b3 Se (121) 
~a<i<w j=— (t 1 jh) 





1— 6 
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Irom the Parseval relation of Theorem 3, one has 














s oO . . oe 
S sin 75 (t — jh) 1 fr? sin py (8) 
a a =] iain Rea enaaeicdes ds=1, (122) 
2 . o 2 
2 jsin7 5 (t — ah) 1 re sin >; ( — 8) 1 
Pe =] ee Sa (8) 
(ene |) 0. G59)) hig 6a Cae, 6 
ts J ar 
The theorem is established. 
Let 
S= ih) |; 
Bs oe (124) 


then a corollary to Theorem 11 is 


Corollary: 
sup | f() | < 8/8. 


Proof: From equations (119) and (120), one has 
sup | {|S SAQ). (125) 


The result follows from Theorem 11. 
The function 
sin 5 "2 (t — jh) sin 5 = 5 6b — ah) 
10 ieee 2 amar : (126) 
: (=m) SS it 

















constitutes an approximation to f(¢). The error may be assessed by 
application of equation (51) for m = 1, and Sonin’s formula (53); thus 


Ag Pye: 1 T\" 
i- oth s3y((W4+5-a) +(v+5+q) ) a2 
For 0 < a < 1, let 


v= [ghar tt CSSir)}) a] +5 es 


then direct verification establishes 


lig lle<-2 = ede (129) 
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It may be observed that for large c, one has 


Cc oe ie 
m(1— 6) 2h’ 





N= (130) 


that is, N is approximately the number of nodal points jh in (— T'/2, T'/2). 


Let 
at) = | Acasa | (131) 


2a 


and 


Bf) = (B-w(f), +++ , Bw(f)); (132) 


then the set Uz is defined to consist of all f(¢) generating the same vector 
8 = B(f). It will now be shown that the diameter of U, does not exceed 
2e. Let f,(2), fo(t) « Ug ; then 


2ae 


| fiGh) — fo(gh) | Ss A(8) (133) 





One has 
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sin t — jh) sm >_> (é — gh) 





; (134) 








5 (t — jh) 
and hence, by equation (129), 
[i — RO |S De | AGh) — f2Gh) | 


6a Pare o : 
i 5 (E — gh) sin >; = 5 (E — ah) 


qth) -7=3tS mW 





sin 





+ 2(1 — ale (135) 





in which N is chosen as in equation (128). From equation (133), one has 





MO — £018 ae 


do Sei Nove o , 
aan 5 (6 — gh) sin 7 5b — ah) 


— jh) 5 (t — ih) 





sin 





+ 211 — ae. (136) 
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Use of equation (120) shows that 
Fo fa lle Ss 2e: (137) 


The sets Ug are centerable with respect to themselves; that is, there 
exists an element g(t) e Ug whose distance from any other element of Ug 
does not exceed «. Consider the function g(é) defined by 








: ba F ; 
sin ;—— (t — jh) sin 
0 = Fv 6 +9 =? — 


g(t — sh) 
i, Se Ca) 























(138) 
One has 
| fGh) — gGh) | Ss aG » ies (139) 
and 
1 — g@ 
——; (t — gh) sin ——— (t — jh) 
= ©, Gh — gm) sper 
To3¢- Mm paz 
sin ——, (é — jh) sin -——— (t — jh) 
+ © iG) 4=*—— 4-2 ao) 
li1>N : (t on jh) jh) 
hence, by equations (120), (129), and (139) 
lIf-glkse (141) 


The required constructive algorithm, I, is thus given by the mapping 
f — g in equations (131) and (188). 
Theorem 12: V(T) = (2N + 1) log {[A(6)M/2ae] — [—A(5)M/2eaej + 1}, 
in which N is given in equation (128). 
Proof: It is necessary to enumerate the number of distinct g(¢) which 
are generated by I'(B,(//)). Since 

A(8) | fGih) | . A()M 


Qae = 2ae 


(142) 
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the number of distinct values of 8;(f) 1s 


Bes 2 | -4oue| ae (143) 


2ae 2ae 
and hence the number of distinct vectors G(f) is 
2N+1 
{40% | _ | 4m] is i} (144) 
2ae 2ae 


The theorem follows from equation (144). 


Corollary 1: V(T) S (2N + 1) log ((M/2ae6?] — [—A//2ae6*] + 1). 
Proof: Theorem 11. 


Corollary 2. V(T) < (2N + 1) log (M/ae(8)? + 2). 
Proof: Corollary 1 and the inequalities 


M M 
Far «at : es ve 


M 


IV. THEORETICAL INVESTIGATION OF W, 


Using Theorem 3 for f, ge W, , the Sobolev inner product 


T/2 eens 
hoe = [Wat mit + af 9) at (146) 
takes the form 


. sin 5 — v) 
G, 9). = a [ Ee 
“(1 + py + +++ + pw’) F(u)GQ) du dv, (147) 
in which F(u), G(u) are the Fourier transforms of f, g respectively. The 


corresponding positive definite quadratic form Q is 


P sin 5 
=f I ru — — 
“(1 + py + +++ + uu'v’)FU@)FQ) dudv, — (148) 


and an operator K generating Q is given by 
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sind (w — 0) 
KF = [ 


a(u — v) 
“(1 + py +--+: + y,u’v’)FO) do, |b | rs (149) 
thus 


oe [ ” PKF du. (150) 


The equation defining the eigenvalues and eigenfunctions of K is 
K@, = AgPx 5 k = 0, (151) 


in which the ordering \) 2 \; 2 A» 2 --- is used. It follows from the 

Hilbert-Schmidt theory” that the eigenvalues are denumerable and of 

finite multiplicity and the eigenfunctions form an orthonormal set 

which, from the positive definite character of K, is complete in L’(—<c, o). 
Let 


alt) = ap a : e'@,(u) du; (152) 


then the Parseval relation for Fourier transforms shows that the 
sequence ¢o(t), o:(t), d2(t), --- is orthonormal over (— ©, ~); further, 
from equations (147), (150), and (151), one has 


0 jH#k 


(¢;  Pi)s a i ©,K,; du = r; [ ©; ®, du 
By. Pees: 53) 


Thus the sequence {¢,(t)}$ forms an orthogonal system with respect to 
the Sobolev inner product (146). The system {¢,(t)}9 is also complete 
in W?., as a consequence of the completeness of the system {®,(u)}% in 
L’(—«, «). 

Define the n-dimensional subspace X, C W, by 


Xn = Xaldo, -+-  bn-1) (154) 
then Theorem 13 provides the nth width of W7 ,(B), relative to H7 , 
in terms of the eigenvalues of K. 
Theorem 18: d™*"(W7,(B)) = B),.. 
Proof: Let f(t) « W.,.(B); then 


io) 


{®) = D2 a¢,(). (155) 


k=0 


BANDLIMITED FUNCTIONS 1935 


Let 


n—-1 


g(t) = du ad(t)eX, ; (156) 


then the orthogonality of the ¢,(¢), (153), yields 


eo 


f-oIB= Dla Pr. (157) 
Thus 
inf [fo IPS Dla Pn. (158) 


From the monotonicity of the d; , one has 
inf |[f— 9 || Se De | ae |; (159) 
however, the orthonormality of the ¢,(t) over (— ©, ©) shows that 


[io ra= Dla P se (160) 


and hence from equation (159) 
Ex,We,(B)) = sup inf ||f—g ||, $ Bi. (161) 
Xn 


feWo,s(B) ge 


Thus 
d,(W?,.(B)) < Bd'. (162) 
Consider the ball U,,,; defined by 


n 


g) = Lad, — |g lle S Br; (163) 
then, by a theorem on balls in a finite dimensional subspace of a Banach 
space,” 

dU) = Bh (164) 


Thus the theorem will be established if it is shown that the ball U,4, 
defined in equation (168) is contained in W,,,(B). It is only necessary 
to verify that 


[lo Par= Slaps Be. (165) 
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One has from 


hg ll} = Dla |? a. S Br, (166) 
k=0 
that 
nr n Xr 
Dials Dla its B, (167) 


and hence the theorem is proved. 

Use of the series representation of Theorem 4 permits one to estimate 
a" (Wz ,(B)). The quantities m and 6, are as in equation (48); addition- 
ally, the corresponding interval h,, is defined by h, = a(1 — 6,)/c. 


Theorem 14: 








Hel qT 2 B e” 
ds” (WewlB) Se aT 
(cm + n(n —l1-— *)) 
Proof: Let 
Kd) = Ss 1,0, (168) 
and 
g(t) = Po {(jh) 6,(t) ; (169) 
then 
11) — 9D 1S Xe 14Gh) || 90) |. (170) 


Since, by Parseval’s relation of Theorem 3 


a J lf Pdt=h yy | {Gh) |? < B’, (171) 


Schwartz’s inequality applied to equation (170) yields 


B? 
1f — oD PST 2a alt. (172) 
One has, from equation (51) 


Wee eT bo) Bl ae N>=. (173) 


Oh, 
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One may use Sonin’s formula, equation (53), to effect the summation 
in equation (173); thus 











2 L " 
te ee a ea a 
(nam + n(w +5 ~ 55) [ral 5-3) 
(174) 
The choice 
_ |7 i_ 7 
ici E (v 5 al (175) 
leads to 
_ = 
I~ olle sy (176) 
(iem + ow +5 — a5) 
Thus equation (176) shows that 
2 3 —m 
atsi,0v2() = ® (2) ———"____, ar 
(em ae n(w Sere | 
and, hence, for n odd 
d"(Wz.(B)) S 23, lee ee (178) 
(em + nfo — 7) 
For n even, one has 
am" W?(B) s =F, a (179) 
rh 


(em Ar D(n = 1 a | 


thus equation (179) applies in all cases. The fractional guardband is 
now chosen as in equation (48), and the inequality of the theorem 
follows. 

Theorem 13 permits an immediate corollary to be obtained from 
Theorem 14. 


Corollary: Fors = 0, one has 


4 ao 


o @m + y(n — 1 — ; 


Int 
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As was done in Theorem 7, polynomial approximation will be used 
to estimate d”"* (W7 ,(B)). The estimate is given in Theorem 15. 
Theorem 18: 


(2e)""" 
nt (7")(en Si) Gn Pops: ny 





ats? (B) = Br(2) 


Proof: The estimate will be obtained from equation (80). In order to 
estimate M,,,, , consider 


1 tut 7 

10 = oy i oP (W) du, (180) 

from which one has 

I : iu 2)2 
g(x) = Bote ‘T/) 2B (uy) du. (181) 
Accordingly 
go iu(T/2)2x r : 

oe ( ry oa : fe (iu)'F(u) du. (182) 


By use of the Schwartz inequality, one obtains 
(r) 2 T as 1 . 2r 2 
jo@) PS (5) sf wraw ff | r@ Pau. 48a) 
2 2a ~¢ -o 
The Parseval relation for Fourier transforms 
[ faa f [Pe Pas B (184) 


and equation (184) now yields 


cr” 





(r) 2 
| @) | =D ee aren (185) 
Thus 
4 goes 
Muss 3 = 2(2 2) On + os + DF (186) 


The remainder of the analysis is the same as in Theorem 7. 
Theorem 13 again permits an immediate corollary to be obtained 
from Theorem 15. 
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Corollary: 


2% (2c)?"* 28 


Anes s a I’ 2 . 
T 2n 
nl 2) (2n + 1)(2n + 2s + 1) 


Theorems 14 and 15 lead to corresponding estimates of entropy 
through use of Theorem 1. 


Theorem 16: Let 


nz2ot(r+(*)J, Ox eet: OY zeal, 


m= [in (gama (*) )I 


and 





then 
HA (W;o(B)) 


s+ Pe (0+ Ginn) Tn Os 24) 


Proof: From Theorem 14, one has 

















@W2(B) = 2() ———*"_—__._ asi) 
is (om + D(n —2—- j )) 
From equation (48), one has 
In + 1 = 3, ' = (22 (in — 2»); (188) 
hes 
hence 
9 i 
d,1(We,o(B)) S 2, 3(*) e” ; Tete (189) 
TV3 T 1 26y"7 
(— a" - 4) 
Since 
1\2 
: ; <1 for nz2t(it (*)), (190) 


a) 
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d,1(W? ,(B)) obeys the inequality 


r PAO 
d,-(W29(B) S —25 (2) e™, (191) 


Tv 


According to Theorem 1, one may consider 


-2. (#)e" 2 — ae (192) 


n= [n(ggea@))I: ee 


The remaining analysis is the same as that of Theorem 8. The inequality 
of the theorem now follows. 


Theorem 17: LeO <a < 1,7 = TB(2c)'/(1 — adee(ac)’, 


n = max (4 ; e”) : {() real, 


and hence 


then 
H (W-,.(B)) 


2 
2 Ing +1 ~ In (2 ny) |] 


Sjstl+|—y yy. 7 ~«~*(WIf bee 
| in (2 mn) 41 +h | 


(2B! 42 *), 
QlE a 


Proof: The proof parallels that of Theorem 9. 
It may be useful to observe 


No S sa y. (194) 
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A Satellite System for Avoiding Serial 
Sun-Transit Outages and Eclipses 


By C. W. LUNDGREN 
(Manuscript received March 20, 1970) 


The motions of satellites phased in particular, slightly inclined orbits 
are timed so that different satellites are north and south of the equator 
when sun-caused outages occur in geostationary equatorial systems. 


I. INTRODUCTION 


Communication satellite systems experience predictable service 
interruptions involving the sun. A sun-transit outage occurs when the 
pointing angles from a receiving earth terminal to a satellite and to 
the sun so nearly coincide that the additional noise power presented 
by the sun renders transmission unusable.* When a satellite passes 
through the earth’s shadow, its solar primary power is interrupted and 
its sunlight-dependent heat balance is upset. 

A geostationary system serving a2 common coverage region may in- 
clude several satellites spaced less than 10° (175 mrad) in the syn- 
chronous equatorial orbit. Figure 1 illustrates the timing of sun transits 
and eclipses occurring in rapid series for three geostationary satellites 
during one day at the spring equinox, observed from an earth terminal 
located on the equator at longitude 0°W. One sun transit near noon 
and one eclipse 12 hours later are observed for each satellite served by 
this terminal. Eclipses of closely spaced satellites may occur at the same 
time, and sun transits of different satellites may also occur simultane- 
ously within a large coverage region. 

Daily sun transits of all geostationary satellites serving an earth 
terminal occur during one week in the spring and again in the fall. 
Service interruptions can last five minutes or more per satellite. Affected 
outage regions are large and move so rapidly that terrestrial restoration 
is unattractive. 

Conversely, a minimum of one working and one spare geostationary 
satellite are required for restoration independent of terrestrial facilities. 


1943 
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Fig. 1—Sun transits for earth terminal on equator, and eclipses of geostationary 
satellites at equinox. 


Such redundancy is also required for adequate protection against 
satellite failure, since satellite replenishment intervals are prohibitively 
large. 

A fully redundant geostationary system incorporates duplicate trans- 
missions to working and spare satellites and duplicate reception from 
these satellites continuously at all earth terminals. Partially redundant 
systems depend upon redirection of earth antenna beams to spare 
satellites.* . 

Rapid, highly coordinated switching between geostationary satellites 
is required at all earth terminals to restore serial sun-transit outages. 
Numerous residual transmission “hits” result from such switching. 
Also, the orbit spacing must be sufficiently large to prevent simul- 
taneous mutual outages of the different satellites at different locations 
within the coverage region to avoid additional switching complexity. 
A spacing as large as 8° (140 mrad) is necessary to prevent mutual 
sun-transit outages within the contiguous United States.? 

Alternatively, serial sun transits are avoided by phasing the satel- 
lites in particular, slightly inclined orbits with motions timed so that 
one satellite is north of the equator and the other is south during both 
the spring and fall outage events. Only one switch of reception between 


* If the earth terminals are equipped with duplicate antennas, transmitters, and 
receivers, the capacity of both satellites can be utilized except during outage periods. 
t The 48 continental states, excluding Alaska and Hawaii. 
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the separated satellites is required per sun-transit season. The exact 
timing (hour) is unimportant and may be different for the convenience 
of each earth terminal. Except for these two switches, all earth termi- 
nals throughout the coverage region are afforded uninterrupted recep- 
tion throughout the year. Mutual sun transits within the same coverage 
region are also avoided by this satellite diversity, and the large orbit 
spacing discussed above for geostationary satellites is unnecessary.* 


II. SUN TRANSITS AND ECLIPSES 


Sun transits and eclipses of geostationary satellites occur during the 
spring and fall seasons. The exact dates of the former depend primarily 
upon the latitude of the receiving earth terminal. 


2.1 Sun Transits of Geostationary Satellites 


The geometry and duration associated with a sun transit are con- 
trolled by (2) the off-axis gain of a properly pointed earth antenna, 
(it) the receiving system noise temperature, (227) the solar noise power 
profile, and (zv) the minimum acceptable signal-to-noise ratio. 

In Fig. 2 the sun’s rays are assumed to be parallel; refraction cor- 
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Fig. 2—Planar approximation of geography affected by a sun-transit outage. 


rections are neglected, assuming a sufficiently large angle of incidence 
to the atmosphere for the desired ray SE. The affected outage region 
is defined approximately by the locus of all points on the illuminated 
earth’s surface for which earth antennas aimed at satellite S also point 


t See Sections 3.3 and A.5. 
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a prescribed minimum angular distance a° away from the sun’s center. 

An estimate of the geography involved is provided by the elliptical 
intersection of a cone of angular radius e°, symmetrical about satellite- 
shadow axis SP with its apex at S, and the horizontal plane at P. It is 
elongated north-south in the figure. 

The sun is assumed to be a uniform disk source of thermal noise 
about 0.5° in diameter.* Shapes and magnitudes of the solar noise 
power profile vary strongly with time and radio frequency. Edge 
brightening at the lower microwave frequencies approaches a factor of 
two, and comparable variations of total flux with time are common.’” 

A minimum solar noise temperature for the mean quiet sun (total 
flux averaged over the disk) is about 25,000 K for a single polarization, 
inferred from measurements at a wavelength of 10.3 em.?* This is 
approximately the minimum temperature presented to a sun-pointed 
ideal antenna at 4 GHz whose beamwidth is less than 0.5°. 

Convolution of an appropriate solar noise profile with a known earth 
antenna gain pattern provides an estimate of increased noise versus 
angular displacement of the sun center from the main beam axis. 
Estimates for the minimum displacement permitting acceptable recep- 
tion at 4 GHz range from about 0.6° (10 mrad) for very large earth 
antennas (80 m) to greater than 1° (18 mrad) for small antennas 
(8Sm).’"° Corresponding minor axes of outage regions range from 800 
to 1300 km. Major axes occurring along satellite-earth longitudes are 
equal to the minor axes at the equator and approach 1.5 times the 
latter at high latitudes. 

Because of synchronism between earth rotation and satellite revolu- 
tion, each outage region appears to move. One at 41° north latitude 
traverses the contiguous United States from west to east in approxi- 
mately one-half hour at noon of the time zone at the satellite’s longitude 
(see Appendix A). 

Figure 3 illustrates the path of an outage region. Each path is tangent 
to the latitude intercept of the center of the satellite’s shadow at ap- 
parent noon at the satellite’s longitude. For all other longitudes in the 
Northern Hemisphere, the path lies slightly to the north of this latitude. 

Hence, in very late February or early March, short daily outages 
affect earth terminals situated near the United States-Canadian border. 
Two to three days later these terminals experience maximum outages 
lasting five minutes or more, depending upon transmission parameters 
and permissible signal-to-noise ratios. Outages at these terminals end 


* The optical disk has a diameter of about 29 minutes of arc, in geocentered 
angular measure. 
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Fig. 3—Approximate paths of sun-transit outages for geostationary satellite. 


after an additional two to three days, the outage paths progressing 
southward at a rate of about 3° latitude per day. All outages affecting 
United States earth terminals above north latitude 26° cease prior to 
mid-March. 

Conversely, in the fall the daily outage paths progress from south 
to north, affecting southern United States terminals about October 1 
and ending in the north about mid-October. 

In Fig. 3, based on parameters adopted in Appendix A, a given earth 
terminal is affected about six days, twice yearly, while the contiguous 
United States experiences outages throughout a 14-day period, again 
twice yearly. If a multiple-feed antenna or a rapid-slewing antenna is 
employed to switch reception at an earth terminal from an affected 
satellite to another 6.8° (120 mrad) westward in the geostationary 
orbit, transmission from the latter satellite is interrupted only 30 
minutes later. 


2.2 Leltpses of Geostationary Satellites 


Eclipses of geostationary satellites can be expected for a total of 
about 90 evenings per year in the spring and fall. Concurrent eclipses 
occur for geostationary satellites spaced less than 17.6° (310 mrad). 
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Eclipses occur near apparent midnight of the time zone at each satel- 
lite’s longitude, beginning in late February or early March and ending 
by mid-April. Fall events begin about September 1 and end about mid- 
October. Eclipses lasting about 70 minutes occur on the dates of the 
spring and fall equinoxes; those lasting longer than one hour occur 
about 50 days per satellite per year. 

Communication satellites are provided with batteries to prevent 
circuit outages and to maintain antenna pointing, attitude control, 
station keeping, telemetry, and command capabilities during eclipses. 
However, concomitant voltage and temperature fluctuations, loss of 
the solar reference for antenna pointing, and related ground command 
activities may contribute to an increased likelihood of satelite failure 
or a reduction in transmission capacity during eclipses. 


III. DIVERSITY SYNCHRONOUS SATELLITE SYSTEM 


A minimum arrangement of two slightly inclined, circular synchro- 
nous orbits with deliberate phasing of one working and one spare 
satellite in their respective orbits is suggested for providing space 
diversity during outage periods. The specific orbit parameters and 
satellite phasing are chosen so that they may remain unchanged 
throughout the year. Thus satellite station-keeping fuel expenditures 
are comparable to geostationary values. The parameters are also chosen 
so that only one noncritical handover of reception between satellites is 
required per sun-transit season. 


3.1 Basic Satellite Phasing in Specific Inclined Orbits 


Figure 4 illustrates the relationship between a ‘‘figure 8” pattern 
traced out by a synchronous satellite and the magnitude of its orbit 
inclination. Recent descriptions of such patterns are given by Rowe 
and Penzias,° treating the efficient use of orbit longitude. 

Figure 5 illustrates the satellite phasing and timing of motions 
required for a two-satellite diversity system. The time reference selected 
for describing these motions is initial time ¢) mean solar hours, marking 
the advent of 12 o’clock noon (apparent, or sun time) on the date of 
the vernal equinox at average @ of mean longitudes 6, and 6, degrees 
west for satellites $,; and 8, , respectively (6 = (0; + 62).v). For satel- 
lites sharing radio frequency bands, a minimum orbit spacing between 
interfering satellites is generally specified consistent with resolving 
powers of the earth antennas. Accordingly, a minimum satellite spacing 
x degrees is assumed between mean longitudes 6, and 62. 
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Fig. 4—EHarth synchronous orbits and figure 8 patterns. 


In terms of 6, chosen for service to a particular geographical region, 
the mean orbit longitudes shown in Fig. 5 are 


6, = (6 — 2/2), 0. = (0 + 2/2) degrees west. (1) 


Dimensions of 8 patterns allowing adequate diversity between 
properly phased satellites are determined in Appendix B. Peak satellite 
displacements from the equatorial plane (geostationary orbit in Fig. 5) 
coincide in the spring and fall with sun transits of each satellite’s mean 
longitude meridian. 

For example, in Fig. 5(a) satellite 8, is northernmost in its 8 pattern 
prior to apparent noon at average longitude 6. To an observer located 
at earth longitude 6, , this coincides with alignment of the sun behind 
the 8 pattern for satellite 8, . 

At apparent noon at longitude 6, satellite 8, in Fig. 5(b) moves 
very slowly toward the geostationary orbit, while 8, is approaching 
the southernmost point in its 8 pattern. The sun is located midway 
between the 8 patterns. 

Shortly after apparent noon at longitude 6, the sun aligns behind the 
8 pattern for satellite 8. , as observed from earth longitude 6, . At this 
time, satellite S. reaches its peak excursion, while satellite S, moves 
more rapidly towards the geostationary orbit [Fig. 5(c)]. Tick marks 
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Fig. 5—Phased satellite motions. 


on the 8 patterns are labeled according to each satellite’s location at 
times referred to longitude 0. 

Paths of the sun on consecutive days during the spring sun-transit 
season are also indicated. Note that these daily paths progress from 
south to north in accordance with a decreasing southern declination of 
the sun’s rays at this time of year (cf., Fig. 3). 
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Circles of radius a° centered at each satellite define the minimum 
pointing angle to the sun for earth antennas directed at the satellites. 
Hence, reception from satellite 8, is interrupted when the sun is within 
the circle for 8, . Tick marks give positions of the sun along its path, 
again at times referred to longitude 0. 

In Fig. 5, uninterrupted reception from satellite 58, is assumed 
throughout the late fall and winter, until March 7. At any convenient 
time between March 1 and March 7, an earth terminal observing these 
motions redirects its reception from satellite 8, to satellite S,. This 
allows uninterrupted reception from S, until the fall sun-transit season, 
during which this noncritical procedure is reversed. 

Note that the 8 patterns in Fig. 5 are larger than required by a 
single earth terminal. The dimensions determined in Appendix B are 
sufficient to prevent serial sun transits throughout the entire latitude 
range of the coverage region, so that only one outage region from 
either satellite may traverse any part of the coverage region on any 
day. This simplifies switching between satellites in restoration schemes 
involving large numbers of working satellites and a minimum of one 
spare satellite.* However, for the basic scheme involving duplicate 
transmission via equal numbers of working and spare satellites, the 
dimensions of the 8 patterns may be reduced until the outage circles 
(a°) are almost tangent to the geostationary orbit. Redirection of the 
earth antenna appropriate for Fig. 5 is required on or about March 4 
for such reduced 8 patterns. 

Note also that the satellites spend most of the time near the extremes 
of the 8 patterns, providing near-maximum diversity separation for 
several hours near noon. This tolerance to timing errors is particularly 
useful since the apparent alignments of the sun in Fig. 5 and the timing 
of transit events are somewhat different for observers at different loca- 
tions within the coverage region. Allowances are made in Appendix B 
in the computation of required diversity separation for both latitude 
and longitude ranges of the coverage region, assuming that uninter- 
rupted reception from the unaffected satellite is required continuously 
at all earth terminals throughout the coverage region. 

The diversity performance is made nearly independent of arbitrary 
satellite spacing x by phasing each satellite so that its maximum lati- 
tude excursion occurs at sun transit of its mean longitude meridian. 

Tick marks in Fig. 6 illustrate a daily progression of satellite posi- 
tions at apparent noon at longitude @ throughout the year. This regular 
shift is observed in the zdeal case at the earth terminals because such 


* Discussed in Section 3.3. 
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Fig. 6—Shift of daily satellite positions. 


inclined synchronous orbits tend to maintain fixed orientations in 
space as the earth revolves around the sun (about 1° per day), illus- 
trated in Fig. 7, by virtue of conservation of orbit angular momenta 
m, and m,. Orbit perturbations, or departures from the above ideal 
motions, are approximately the same as those for geostationary orbits 
and are corrected by firing small station-keeping rocket motors at 
intervals throughout the lifetime of the satellites. 

Specification of orbit stabilization with respect to the fixed stars is 
necessary to obtain properly timed satellite diversity automatically 
throughout the year; the precision required for diversity is needed only 
during outage seasons. 

Hence, the daily period of satellite motions in their figure 8 patterns 
is less than 24 hours of civil time (mean solar hours). The actual side- 
real period is 235 56™ 04.09054 in mean solar time measure. 

The daily shift of positions is utilized, by the deliberate orbit orien- 
tations and satellite phasing in the orbits, so that the apparent positions 
of satellites S, and S, are reversed automatically in time for diversity 
reception again during the fall outage season (see Fig. 7). Positions are 
also reversed daily, providing diversity for satellite eclipses near mid- 
night, assuming sufficiently large orbit inclinations. 

Conversely, lesser accumulated shifts must also be considered in 
computing the minimum diversity separation for sun transits for 
coverage regions located far from the equator, since sun transits occur 
either before, or after actual equinoxes (see Figs. 5 and 6, and Appen- 
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Fig. 7—Satellite space diversity with respect to the sun and seasons. 
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dix B). Sun transits are observed in the Northern Hemisphere prior 
to the vernal equinox and again after the autumnal equinox. Offsets of 
approximately two weeks from the symmetrical case are representative 
for the contiguous United States. Of course, dates of satellite eclipses 
are independent of earth latitude and the ideal symmetry is applicable. 


3.2 Orbit Parameters 


Satellite motions and initial conditions are illustrated in Figs. 7 and 
8 for two diversity satellites. 


3.2.1 Inclination of Orbits 


The planes of orbits for satellites 5, and S, are tilted slightly with 
respect to the earth’s equatorial plane by inclination angles 2, and 7 . 

For the idealized case of equal inclinations, the minimum required 
magnitudes range from about 2 degrees for avoiding serial sun transits 


to about 9 degrees for avoiding serial and concurrent eclipses (see 
Appendix B). 


3.2.2 Alignment of Inclined Orbit Planes 


Positioning of the figure 8s is accomplished by aligning the orbit 
planes in slightly offset opposition as shown in Fig. 8. Two plane inter- 
sections with the earth’s equator result, each forming acute angles 
(90 — 2/2) degrees symmetrically with the mean equinox axis (inter- 
section of planes of the equator and of the earth’s orbit around the 
sun; direction from earth towards the sun at the vernal equinox). 


LONGITUDE ae SATELLITE 
‘ SHADOWS— _ 


EQUINOX AXIS 


SPRING 





~——~— EQUATORIAL 
PLANE 


Fig. 8—Synchronous orbits phased for sun diversity. 
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3.2.3 Phasing of Satellite S, 


The time of the ascending node in orbit 1 for satellite S, , for spacing 
x degrees is 
t, = t — [(6 + x/80)] mean solar hours, (2) 


so that at time t = ft) — x«/30 hours satellite 8, necessarily assumes its 
maximum north latitude (upper limit of excursion for left-hand figure 8 
pattern in Fig. 5). From Fig. 8, note that the semi-major axis of the 8 in 
geocentered angular measure is equivalent numerically to orbit inclina- 
tion 2; . 


3.2.4 Phasing of Satellite S, 
The descending node in orbit 2 for satellite 8, is specified by 
to = ts — [(6 — x/30)] mean solar hours, (8) 


for which satellite S. assumes its maximum south latitude at time 
t = ty + «/30 hours. 


3.2.5 Satellite Motions Related to the Sun and Seasons 


By synchronizing satellite motions and timing with respect to the 
earth’s revolution about the sun as shown in Fig. 7, the required space 
diversity is obtained during both spring and fall outage seasons. 

Satellite motions and timing are specified above in terms of initial 
conditions at the vernal equinox. Of course, actual satellite launching 
is not restricted to any season, provided that satellite motions coincide 
with those for the specified system at the times when sun-caused 
outages occur in geostationary equatorial systems. 


3.3 Phased Multisatellite Systems 


Two satellites are required for the basic diversity system. The 
diversity satellites may be placed as desired in orbit longitude con- 
sistent with an assumed minimum orbit spacing z. 

An obvious system growth is to add uniformly spaced, alternately 
phased working and spare satellites along the orbit (Fig. 5). Note that 
one of a diversity pair of spare satellites can restore all working satel- 
lites if fast switching may be employed daily at the affected earth 
terminals. Reception is transferred in sequence between transitted 
active satellites and the unaffected spare.* The orbit spacing between 
second-adjacent satellites (same phasing) should be sufficient to prevent 
mutual sun transits of the latter satellites within the coverage region. 


* The affected spare is available as an additional working satellite. 
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For this case, only half of the orbit spacing required by geostationary 
satellites is required by the diversity satellites (about 4° for avoiding 
mutual outages within the contiguous United States). 

Conversely, for satellites which may be closely spaced (2 = 1°), 
efficient use of the orbit may result from judicious incorporation, in a 
manner consistent with the satellite phasing and timing described 
above, of orbit loading techniques suggested by Rowe and Penzias.° 
Deliberate relative phases in adjacent 8 patterns may prevent major 
multiple sun transits of all satellites near the same latitude and mini- 
mize daily switching to different unaffected satellites (Section 3.1). 

For large orbit inclinations,* (2) tracking of satellite and earth 
antennas is required, (77) reduction in latitude of the coverage region 
results, (277) transmission at low angles of arrival is more susceptible 
to atmospheric degradation, and (2v) the interference exposure between 
radio relay and satellite services is increased. 


3.4 Antenna Requirements for Earth Terminals and Satellites 


Only slight geometric departures from the geostationary case are 
required to obtain diversity for avoiding sun-transit outages; some- 
what larger departures are required for avoiding eclipses. Hence, 
satellite radio transmission parameters appropriate for corresponding 
geostationary designs are essentially retained. 

Earth antennas need follow only slow and very small periodic satel- 
lite motions. These motions are accomodated reliably by conventional 
24-hour cyclic cam drives (sidereal time measure). Costs and mainte- 
nance for such antenna drives are virtually insignificant when compared 
with those for full automatic tracking. Cyclic drives are appropriate 
for a large deployment of small earth antennas requiring moderate 
beam-pointing precision, while costs for full automatic tracking are less 
significant for a smaller number of large antennas requiring precise 
beam pointing. 

A minimum earth antenna steering requirement accommodating 
orbit inclinations up to 10° (175 mrad) and satellite longitude drifts 
from assigned orbit stations of -+-10°, for satellite elevations of 5° or 
more, is reported by the Communications Satellite Corporation for 
quasi-stationary satellites.’ Such earth terminals are compatible with 
the diversity satellites, since in the ideal application the smaller de- 
sired orbit inclinations are also maintained continuously. 

The spin axis of a satellite is maintained perpendicular to its orbit 
plane, in the simplest wheel-mode attitude stabilization. Satellite 


* For x = 1°, 7 = 10.7° and for = 5°, 7 = 24°, from equation (7) of Ref. 6. 
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antenna pointing referred to this axis benefits from partial compensa- 
tion of pointing errors otherwise accompanying departures from the 
equatorial plane in the inclined orbits.* 


IV. CONCLUSIONS 


Space diversity is provided automatically at times of sun transits 
and eclipses by a convenient modification of a geostationary system 
in which the satellites appear to move in figure 8 patterns. Alternate 
satellites are oppositely phased, so that when one satellite is north the 
other is south. Orbit orientations and timing of satellite motions are 
arranged so that near the spring and fall equinoxes, when geostationary 
satellites transit the sun, the diversity satellites are at extreme north 
and south positions, allowing uninterrupted reception from at least 
one satellite. 

The contiguous United States is cleared of serial sun-transit outages 
if orbit inclinations of about two degrees are employed. Concurrent 
satellite eclipses are also reduced in frequency and duration, and are 
avoided by increasing the orbit inclinations to about nine degrees. 

Neglecting perturbations common to synchronous orbits including 
the geostationary orbit, the satellite deployment is steady state. 
Satellite launching requirements, mean station-keeping precision, and 
lifetimes are comparable to the geostationary case. 

Diversity is provided automatically during both spring and fall 
outage seasons, requiring two noncritical switches between satellites 
per year. 

Relatively minor modifications of earth terminals and _ satellites 
designed for geostationary service are required. 

The diversity satellites are positioned as desired in orbit longitude 
without degrading system performance significantly, consistent with 
minimum orbit spacings to control interference from neighboring 
satellites. 

Transmission via the unaffected satellite of a diversity pair can be 
switched in sequence daily to restore all transitted active satellites of 
a larger system. 

One-half the minimum orbit spacing required by geostationary 
systems to prevent mutual outages of neighboring satellites within 
large coverage areas is required by the diversity system, since only 
alternate satellites experience outages on a given day. 

Sun-transit outages in satellite circuits can be restored without 
involving terrestrial facilities. 


* For 7 = 2°, a peak uncompensated pointing error of 0,3° is representative, 
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APPENDIX A 
Simplified Geometry and Numerical Examples for Geostationary Satellites 


A.l Minimum-Latitude Circle Tangent to Outage Path 


At the satellite longitude, conjunction of the sun and satellite occurs 
at apparent noon and the satellite’s shadow intercepts a minimum 
latitude, shown in Fig. 9. On successive days in the spring, the shadow 
path becomes tangent to a smaller north latitude at the satellite’s 
longitude, and lies slightly to the north of this latitude for all other 
longitudes. 

Figure 9 illustrates the sun’s rays on March 4, 1970. From an alma- 
nac, the apparent declination of the sun for 0 hours ephemeris time 
(E.T.) is —6° 40’ 54’’.5 and on March 5, is —6° 17’ 49’.0. 

Iiphemeris transit of the sun on March 4 is given as 12? 11™ 50%.39 
and the reduction AJ’ from universal time (U.T.) to E.T. for the year 
1970.5 is approximately 40%. The ephemeris time corresponding to solar 
transit at west longitude \° is 


E.T. + E.T. (TRANSIT) + [1.002738] 555 


(24") hours, 


A < 180°, (4) 


where the coefficient in brackets is the approximate ratio of the mean 
solar day to the mean sidereal day. Allowing for a 6-hour time differ- 
ence from the Greenwich Meridian to the Central Time Zone, 


CS.T. = E.T. — AT — 6 hours. (5) 
Assume a transit of geostationary satellite stationed at ) = 95°W: 
C.8S.T. = 12> 117.84 + 6'20".93 — 0 0".67 — 6° 0". (6) 


From equation (4), the ephemeris time of this event is 18° 32".77 on 
March 4. Interpolating between 05 on March 4 and 0® on March 5, the 
sun’s apparent declination is 


> , 18.55 : 
—6.682° + 5755 (0.385°), (7) 
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Fig. 9—Determination of minimum outage path latitude gmin. 
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Note from Fig. 9, 


SIN Onin = 5 sin D, (8) 
_ h+RQ1 — €08 gin) 
S = aa km, (9) 


where gin 18 the north latitude of the satellite’s shadow at the time of 
sun transit. 
Then 


sin pla — COS min) + 4 


cos D 2 (10) 


SIN Gnin = — 


from which it is determined that ¢,;, = 41.0° north latitude, assuming 
geostationary orbit altitude h = 35,900 km and mean spherical earth 
radius R = 6373 km. 


A.2 Estimate of Speed with Which Outage Centers Traverse U.S.A. 


Figure 10 shows the contiguous United States represented by a 
longitude span of 60° centered at the satellite longitude and located at 
north latitude ¢,;,(41°). Consider a projection of the extreme longitude 
meridians (1.e., 30° referred to the satellite longitude) parallel to an 
assumed shadow axis between the span center at B’ and the satellite 
at B, such that orbit arc intercept AC is specified. 

The geocentric orbit radius is 


AO = CO=R +A = 42,270 km. (11) 
Then the radius of latitude circle gin 18 


RY” = £ Cos gmin = 4810 km. (12) 


The approximate distance measured along latitude circle ¢,;, for this 
model of the United states is 


ae: 2760 
| A'C" loo = Gag Bl = 5040 km. (13) 
Recognizing equilateral triangle A’OC’, the orbit chord is 
AC = A’C’ = R” = 4810 km. (14) 


The solution of an oblique triangle with sides a, b, c and opposite 
angles A, B, Cis 
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Fig. 10—Geometry describing motion of an outage region across a model of the 
contiguous United States. 
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 -@+b0+ec 
cos A = aaa Ts Samana (15) 
so that the orbit are may be found from 
) 
cos (A-O-C) = =AG) + 2h ae hy (16) 


2(R + h)’ 

Then the desired geocentered angle representing the orbit intercept of 
all parallel sun’s rays simultaneously illuminating this model is 

AC = cos (A-O-C) = 6.5 degrees. (17) 


The time required for a satellite’s shadow to traverse a stationary 
representation of the United States is numerically equivalent to the 
time for the fractional revolution of a satellite from position A to C: 


(24 X 60)" 
360° 
However, the actual elapsed time ¢, is greater by virtue of earth rota- 


tion during this interval. The effective longitude span of the United 
States is very eared 


bee = 6.57 X = 26.0 min. (18) 


“~~ 15° 
A’C’ + ‘a = 60° + 26".0 X 60" = 66.5°. (19) 
Accounting for a correspondingly enlarged orbit intercept, 
: 66.5° : 
a oe 4 oer = 28.8 min, (20) 


so that an outage region traverses the United States from west to east 
in approximately one-half hour. The exact interval depends primarily 
UpON. Yin - 


A.3 Estimation of Size of Outage Region—Example 


A conic figure of revolution about axis SP in Fig. 2 defining the 
affected outage region subtends total angle 2a measured at the satellite. 
To enable example calculations without specific reference to antenna 
pattern data, a worst-case minimum angular separation a = 1° be- 
tween a satellite and the sun center is adopted.* 


*The value a = 1° is assumed for a hypothetical 4-GHz satellite system incor- 
porating 55 percent efficient, 30-ft diameter parabolic reflector earth antennas, a 
receiving system noise temperature of 200 K, and a 3-dB allowable increase in 
received thermal noise power. 
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The horizontal plane at the location of the satellite’s shadow P at 
time of transit is shown in edge view in Fig. 11. Slant range S = SP 
is found from equation (9) to be about 37,470 km. The conic section 
defined by this plane and the outage cone is elliptical; point P specifies 
its motions. 

The east-west semi-minor axis r is equivalent to the radius of the 
right circular intersection of the cone and a plane through P normal to 
satellite-shadow axis SP: 


r= Ssina 


655 km. (21) 


Ilr 


The north-south semi-major axis r’ in Tig. 11 is found from a pro- 
jection of the above circular intersection upon the local horizontal 
plane at P: 


ee Cee 
~ C08 (min — D) 


970 km. (22) 


Il: 


AA Estimate of Outage Duration 


The maximum duration of an outage occurring at an earth terminal 
located on latitude ¢,,;, is approximately that fraction of time t, [equa- 
tion (20)] for the satellite’s shadow to travel the 1310-km width of the 
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Fig. 11—Determination of outage region at P. 
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above outage pattern. Allowing for earth rotation, and noting that the 
ys 


60° longitude span A’C’ at latitude ¢,,:, corresponds to chord A’C’: 


2r 


i < te <u X Gr; 
|A’C’| (23) 
7.5 minutes < tz < 7.8 minutes. 


For the satellite stationed at longitude 95° west, the path of its 
shadow on March 4, 1970, approaches latitude 41° north near Omaha 
at 12:32 p.m. C.S.T. Taking dimensions of the outage region into 
account, the West Coast should just begin to experience outages north 
of Eureka, California, at about 10:16 a.m. Pacific Standard Time, and 
the last outages, near Boston, should cease about 1:50 p.m. Eastern 
Standard Time. 


A.5 Geostationary Satellite Spacing and Serial Outages 


Several identical satellites are assumed deployed along the geo- 
stationary orbit. Earth terminals are assumed capable of receiving 
signals from at least one pair of adjacent satellites, either simultane- 
ously or one at a time. The orbit spacing between satellites is assumed 
to be uniform, but adjustable to alter the timing of serial sun transits. 
Numerical assumptions made in previous sections are retained for 
illustration; earth terminals are assumed to be located along the outage 
path (worst case). 


A.5.1 Case 1-Minimum of 80 Minutes Between Switches at an Earth 
Terminal 


If each satellite is assumed to possess spare circuit capacity adequate 
for the restoration of one transitted satellite, it is of interest to esti- 
mate the orbit spacing between satellites required for a prescribed 
outage-free interval between switches at an affected earth terminal. 
The interval between onsets of serial outages at a given earth terminal 
for satellites spaced 6.5° in orbit, allowing for earth rotation is about 
28.8 minutes (Section A.2). Then, an approximate minimum satellite 
spacing for a 80-minute clear interval is (80"/28™.8) X 6.5° = 6.8°. 


A.5.2 Case 2-Minimum of 80 Minutes Between Adjacent Outages 


If multiple satellites are deployed without spare capacity and an 
earth terminal receives simultaneously from adjacent satellites, but 
does not switch between them, a 30-minute required clear time be- 
tween outages of the adjacent satellites leads to a greater estimated 
satellite spacing. The elapsed time for the center of a first (easterly) 
outage region to depart an affected earth terminal and travel eastward 
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until reception is regained (a distance equal to the semi-minor dimen- 
sion; Section A.4) is approximately 7".6/2 = 3.8 minutes. The elapsed 
time for the center of a second outage region to approach the same 
earth terminal is also 3.8 minutes, measured from onset of the second 
outage. The sum of elapsed times and the required 30-minute clear 
interval is 37.6 minutes. The minimum satellite spacing, scaled from 
the 28.8-minute interval between arrivals of shadows at the terminal 
for satellites spaced 6.5° (Section A.2) is approximately (87".6/28™.8) X 
6.5° = 8.5°. 


A.5.3 Case 83-Minimum of 380 Minutes Free of United States Outages 


An estimate of the satellite spacing required for a 30-minute clear 
interval between outages of earth terminals throughout the contiguous 
United States for the case without switching is desired. A time equiva- 
lent of the satellite spacing for a 30-minute clear interval between 
adjacent outages at a single earth terminal is about 37.6 minutes 
(Section A.5.2). A satellite spacing of 6.5° is necessary for simultaneous 
sun transit of a first satellite at the extreme eastern terminal and a 
second satellite at the extreme western terminal; a time equivalent of 
this spacing is approximately 28.8 minutes (Section A.2). The sum of 
these intervals, 66.4 minutes, accounts for transits of all terminals 
within the assumed 60° longitude span at 41° north latitude. The 
approximate minimum satellite spacing for a 30-minute clear interval 
throughout the United States is (667.4/28".8) X 6.5° = 15.0°. 


A.5.4 Case 4-Minimum of 80 Minutes Free of Outages Throughout One 
Time Zone 


The time equivalent of spacing for a 30-minute clear interval at a 
single terminal without switching is 37.6 minutes. The time equivalent 
of spacing for simultaneous sun transits of adjacent satellites at eastern 
and western terminals bounding a 15° time zone is approximately 
(15°/60°) X 28".8 = 7.2 minutes. The required interval is about 44.8 
minutes, accounting for outage dimensions and all terminals within 
one time zone. The resulting minimum satellite spacing is approxi- 
mately (44".8/28".8) X 6.5° = 10.1°. 


APPENDIX B 
Estimation of Minimum Required Space Diversity 


B.1 Minimum Orbit Inclinations for a Prescribed Coverage Region 


Figure 12 relates the latitude extremes of a desired coverage region 
to limits of the sun’s apparent declination angle for which sun transits 
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of diversity satellites can affect transmissions. First, two conic figures 
of revolution as described in Section A.3 having angular radius a 
define the northernmost and southernmost outage regions for geo- 
stationary satellite S,,. Minimum orbit inclinations for diversity satel- 
lites 8, and 8, are estimated by geometric construction. Parallel sun 
rays are assumed and atmospheric refraction is neglected for all but 
extreme latitudes in the presence of fairly large angles of incidence.’ 

One approximation implicit in the figure is that the satellites occupy 
the same mean orbit longitude. This enables a highly simplified geo- 
metrical analysis and the uncertainty introduced is shown later to be 
insignificant. 


B.2 Determination of Minimum Orbit Inclinations 


For geostationary satellite S,, in Fig. 12, apparent declination angle 
limits 6, and 6, are calculated for which the satellite shadows intercept 
north geographic latitude limits y¢, and gy, of an assumed coverage 
region in the Northern Hemisphere between points P, and P, respec- 
tively. Slant range segment P,,S,, is determined from the solution of 
oblique triangle P,OS,, : 


P,Sn = {R’? + (R + h)? — 2RR + h) cos g,], (24) 
and 
P.Sn = (R? + (R +h)? — 2R(R + h) cos ¢,]}* km. (25) 


The declination angles corresponding to northern and southern 
boundaries of the coverage region are 


cog? | E+ PaSa ED 





| degrees S, (26) 





ca XPS) (+ h) 
and 
a2 ee | i SEP Sy) a aE hs 
5, = cos | XP.S.)(R + hy) | degrees S, (27) 


where the units designation S denotes angular displacement south from 
the celestial equator. 
The angle measuring bisector P,,§,, is denoted by 6, , where 


dn = (dn + ds)av degrees S. (28) 


Synchronous satellites 8, and S, are shown in Fig. 12 located on 
great circle C of a geocentered sphere of radius (R + h) whose plane 
contains the mean geopolar axis and an assumed common satellite 


1968 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1970 


meridian circle. The satellites are also assumed to be symmetrically 
opposite and equidistant from the equatorial plane. The required 
distance between parallel sun’s rays through satellites S, and S, having 
mean apparent declination 6, is determined by constructing segment 
O,O, perpendicular to $,,Pn through P, . The base of isosceles triangle 
O,5,O, represents the required ray separation. Making the approxi- 
mation 


0.80 = OSea = PSa, (29) 

and denoting the angle O,-S,-O, by y, 
y = 6, — 6, + 2a degrees. (80) 

From the solution of an isosceles triangle, 
0,0, = [2(P»Sm)°(1 — cos y)]}? km. (31) 


Constructing segment S,S, perpendicular to S,P,, through §,, its 
length is 


S,5S. = 0,0, km. (82) 
The length of chord S,S, between the satellites on circle C is 
S,S, = S,S./cos 6,, km. (33) 
“——~ 


The total geocentered arc S,S, on circle C corresponding to chord 


S,S, is found from the solution of isosceles triangle S,OS, (not illus- 
trated). Note that 


OS, = OS, = OS, = (R + h) km. (34) 
Then 


a —(S,8,) 2(0Sn) 
as Ss Oe 


Note from Fig. 12 that equal orbit inclinations 7, and 7, are determined 
by the minimum geocentered angular displacements of synchronous 
satellites 5, and 8, from the equatorial plane, necessary for avoiding 
simultaneous sun-transit outages between latitudes ¢, and y, . Hence, 


(35) 


LON 
i, = 1, = (S,S,)/2 degrees. (36) 


While the simplified geometry of Fig. 12 results from an assumption 
that the satellites’ mean longitudes are identical, recall from Section 3.1 
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that the maximum satellite excursions are made to occur at the instant 
of zenith transit viewed by an observer at each satellite’s longitude. 
Thus, for earth terminals situated along the longitude meridian of— 
and receiving from—satellite S,, the minimum required orbit incli- 
nation 7, is identical to 7z,. Very slightly increased inclinations are 
necessary to accommodate receiving earth terminals far from this 
longitude. 


B.3 Correction for Longitude Span and Latitude Location of Coverage 
Region 

The maximum time difference At, between sun transit of a geosta- 
tionary satellite centered over the United States and observed along its 
longitude, and sun transit of the same satellite observed at a longitude 
displaced by +30°, for a minimum latitude of 26°N is about -F0.3 
hour, allowing for earth rotation (Fig. 3; Section A.2). The magnitude 
of accumulated time shift At, (civil time versus sidereal time) relating 
positions of satellites at 0° to 0° at the vernal equinox, arising from 
location of the affected coverage region north of the equator, is about 
1 hour (Fig. 6). An approximate worst-case adjustment of orbit inclina- 
tions providing the required displacement of diversity satellites from 
the equator at times when sun transits would otherwise be observed is 


i ee degrees. (37) 
cos [(| At, | + | At, |)(860°/24’)] 


B.4 Illustrative Calculation 


It is assumed that latitude limits », and ¢, for the United States 
coverage region to be cleared of outages are 49°N and 26°N, respec- 
tively. A spherical earth model is assumed with radius R = 6373 km. 
The height of the geostationary orbit h is assumed to be 35,900 km. A 
conic sun-transit outage figure is assumed (Fig. 12), having a radius in 
angular measure of a = 1°. 

Numerical results are obtained using all preceding relationships: 


From equations (24) and (25), P,S, = 38,394 km, 
P,S, = 36,652 km. 


From equations (26) and (27), 6, = 7.200°, 

6, = 4.375°. 
From equation (28), bm = 5.788°. 
Irom equation (80), y = 4.825°. 
From equation (81), 0,0, = 3,232 km. 


w 

3 

2 
T 


From equation (33), 3,249 km. 
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From equation (34), OS,, = 42,273 km. 
“oN 3 
Irom equation (35), cos (S,S,) = 0.997047. 
From equation (86), ty = Ty, = 2.201°. 
From equation (37), ea APY faa 


For larger earth terminals, assuming a = 0.7° for 25-m antennas, 
the corresponding worst-case minimum equal orbit inclinations pro- 
viding the specified diversity is 2.045 degrees. 


B.5 Minimum Orbit Inclinations for Avoiding Serial Eclipses 


The earth’s shadow is assumed to be a circular cylinder with a 
diameter equal to the mean diameter of the earth. This amounts to 
neglecting atmospheric refraction and the distinction between the 
umbra and penumbra shadow regions. For satellites with batteries, 
the net radiation energy lost per eclipse corresponds to a time inte- 
gration of the actual solar-array power output. This is nearly the energy 
loss which would result if the solar source were completely obstructed 
while the satellite traversed the assumed cylindrical shadow. 

An approximate relationship between declination D and the orbit 
are eclipsed is illustrated by Figs. 18 and 14. The length of geosta- 
tionary orbit radius OS is (R + h) km, so that 


O'S = (2 + h) | sin D | km. (88) 
Angle y in Fig. 14 is thus determined: 


/ 


y = sin’ (28) degrees. (39) 








GEOSTATIO 


Fig. 13—Simplified geometry describing satellite eclipses. 
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Fig. 14—Projection of points A, C upon right section of earth’s shadow through S. 


The length of the chord intercept common to both the orbit and the 
cylindrical earth shadow is determined by a normal projection of the 
orbit upon a right circular section of the shadow through S. From 
equations (38) and (39), 


AG AO 2 OP cea {in |@ zi me sin D Tic, (40) 


If the fraction in brackets in equation (40) is smaller than unity for 
a given declination D, an eclipse of the orbiting satellite is indicated. 
For zero values of apparent declination, the chord AC is simply twice 
the mean earth radius R. = 

The corresponding orbit arc AC is next calculated from the solution 
of oblique triangles: 


— — 2 2 
AC = cos” ee degrees. (41) 





Hence, the minimum space diversity in geocentered angular measure 
necessary for avoiding serial satellite eclipses is identified numerically 
with the maximum orbit are intercept, occurring for D = 0°. From 
equation (41), the maximum resulting geocentered angle, corresponding 
to one earth diameter, is approximately 17.6°. Then each minimum 
orbit inclination 7; = 7%, necessary for avoiding serial eclipses in the 
manner of Section 3.2 is approximately 17.6°/2 = 8.8°. 

Finally, it is of interest to estimate the time required for the satellite 
to traverse shadow are AC. The interval Aé, is numerically equivalent 
to the resulting are fraction times the orbit period, corrected for the 
earth’s revolution about the sun: 


oN 


At, = [1.002738](24 X 60)” X at minutes. (42) 
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Adaptive Predictive Coding of 
Speech Signals 


By B.S. ATAL and M. R. SCHROEDER 
(Manuscript received December 13, 1968) 


We describe in this paper a method for efficient encoding of speech 
signals, based on predictive coding. In this coding method, both the trans- 
mitter and the receiver estimate the signal’s current value by linear pre- 
diction on the previously transmitted signal. The difference between this 
estumate and the true value of the signal ts quantized, coded and trans- 
mitted to the receiver. At the receiver, the decoded difference signal 1s added 
to the predicted signal to reproduce the input speech signal. Because of the 
nonstationary nature of the speech signals, an adaptive linear predictor 
as used, which ts readjusted periodically to minimize the mean-square 
error between the predicted and the true value of the signals. 

The predictive coding system was simulated on a digital computer. The 
predictor parameters, comprising one delay and nine other coefficients 
related to the signal spectrum, were readjusted every 5 milliseconds. The 
speech signal was sampled at a rate of 6.67 kHz, and the difference signal 
was quantized by a two-level quantizer with variable step size. Subjective 
comparisons with speech from a logarithmic PCM encoder (log-PCM) 
indicate that the quality of the synthesized speech signal from the predictive 
coding system 1s approximately equal to that of log-PCM speech encoded 
at 6 bits/sample. 

Preliminary studies suggest that the binary difference signal and the 
predictor parameters together can be transmitted at approximately 10 
kilobits/second which ts several times less than the bit rate required for 
log-PCM encoding with comparable speech quality. 


I. INTRODUCTION 


The aim of efficient coding methods’ is to reduce the channel capacity 
required to transmit a signal with specified fidelity. To achieve this 
objective, it is often essential to reduce the redundancy of the trans- 
mitted signal. One well-known procedure for reducing signal redundancy 
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is predictive coding.*’~° In predictive coding, redundancy is reduced 
by subtracting from the signal that part which can be predicted from 
its past. For many signals, the first-order entropy of the difference 
signal is much smaller than the first-order entropy of the original 
signal; thus, the difference signal is better suited to memoryless encod- 
ing than the original signal. Predictive coding offers a practical way of 
coding signals efficiently without requiring large codebook memories. 

Many previous speech coding methods® have employed schemes 
which attempt to separate the contributions of the vocal excitation 
from that of the vocal-tract transmission function. The well-known 
channel vocoder of Dudley’ was the first attempt in this direction. Al- 
though vocoders can reproduce intelligible speech, there is appreciable 
loss in naturalness and speech quality. This degradation in speech 
quality arises from various operations in the vocoding process, which 
are either inaccurately performed or are based on certain idealized 
approximations of speech production and perception processes. 

The present paper describes a different approach®”® to encoding of 
speech signals, based on predictive coding, which avoids the difficulties 
encountered in vocoders and vocoder-like devices. Although predictive 
coding utilizes such well-known characteristics of speech signals as 
pitch and formant structure, its operation does not rely solely upon a 
rigid parameterization of the speech signal. That part of the speech 
signal which cannot be represented in terms of these characteristics is 
not discarded but suitably encoded and transmitted to the receiver 
where it is used in the synthesis of a close replica of the original speech 
waveform. 

Previous studies of predictive coding systems for speech signals’® 
have been limited to linear predictors with fixed coefficients. However, 
due to the nonstationary nature of the speech signals, a fixed predictor 
cannot predict the signal values efficiently at all times. For example, the 
speech waveform is approximately periodic during voiced portions; 
thus, a good prediction of the present value of the signal can be based 
on the value of the signal exactly one period earlier. However, the 
period of the speech signal varies with time. The predictor, therefore, 
must change with the changing period of the input speech signal. In 
the predictive coding system described below, the linear predictor is 
adaptive; it is readjusted periodically to match the time-varying charac- 
teristics of the input speech signal. The parameters of the linear pre- 
dictor are optimized to obtain an efficient prediction in the sense that 


* Another name often used for this kind of encoding is Differential Pulse Code 
Modulation. 
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the mean-square error between the predicted value and the true value 
of the signal is minimum. 


II. PREDICTIVE CODING SYSTEM 


2.1 Description 


A block diagram illustrating the principle of predictive coding is 
shown in Fig. 1. The input signal s(é) is sampled at the Nyquist rate 
to produce the samples s, of the signal. The predictor forms an estimate 
$, of the signal’s present value based on the past samples 7, , 7n-2, °°° 
of the reconstructed signal at the transmitter. The predicted value §, 
of the signal is next subtracted from the signal value s, to form the 
difference 6, , which is quantized, encoded, and transmitted to the 
receiver. At the same time, the transmitted signal is decoded at the 
transmitter and the signal reconstructed in exactly the same manner as 
is done at the receiver. The reconstructed signal is then used to predict 
the next sample of the input signal. 

At the receiver, the transmitted signal is decoded and added to the 
predicted value of the signal to form the samples 7/7, of the reconstructed 
signal. The predictor used at the receiver is identical to one employed 
at the transmitter. The samples 7/ of the reconstructed signal are finally 
low-pass filtered to produce the output signal r’(¢). 


2.2 Signal-to-Quantizing Noise Ratio 


Consider the predictive coding system shown in Fig. 1. Let P, be 
the mean-square value of the input signal samples s, , P; be the mean- 
square value of the difference signal samples 6, , P, be the mean-square 
value of the quantizing noise in the decoded difference signal 6/, and 


TRANSMITTER RECEIVER 


5 DIGITAL ; 
sit) on n CHANNEL rit) 





Fig. 1—Block diagram of a predictive coding system, 
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P. be the mean-square value of the quantizing noise in the reconstructed 
signal 7’ . We will now show that, in the absence of digital channel 
transmission errors, the signal-to-quantizing noise ratio P,/P, of the 
reconstructed signal is given by 


Po Pe Pa, 
Roe, 


In other words, the signal-to-quantizing noise ratio of the reconstructed 
signal exceeds the signal-to-quantizing noise ratio of the decoded differ- 
ence signal by a factor equal to the ratio of the mean-square value of 
the input signal to the mean-square value of the difference signal. The 
predictive coding system is thus superior to a straight PCM system 
whenever P,/P; is much greater than 1. For a signal such as speech, 
this is indeed true. The results obtained by computer simulation of 
the predictive coding system (see Section 3.3) show that P,/P; is 
about 100 for speech signals. By using predictive coding, one could thus 
expect improvement of about 20 dB in signal-to-quantizing noise ratio 
over a PCM system using identical quantizing levels. 

To prove equation (1), we will first show that the error between any 
sample of the reconstructed signal and the corresponding sample of 
the input signal is identical to the error introduced by the quantizer, 
the encoder and the decoder. 

The error e, between the sample 7/, of the reconstructed signal and 
the sample s, of the input signal is given by 


(1) 


6, = 1h Sys (2) 


In the absence of digital channel transmission errors, we can replace r/ 
in equation (2) by r, and rewrite equation (2) as 


é, = (r, — 8) — (8, — §). (3) 
It is readily seen in Fig. 1 that 
rt, = 6, + §, (4) 
and 
6, = &, — §,. (5) 
On combining equations (8), (4) and (5), one obtains 
e, = 6 — 6,. (6) 


The right side of equation (6) represents the error introduced by the 
quantizer, the encoder, and the decoder. Thus, the error in the nth 
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sample of the reconstructed signal is identical to the error in the nth 
sample of the decoded difference signal. 
The signal-to-quantizing noise ratio of the reconstructed signal is 
by definition P,/P, and can be written as 
eas 
PoP ee 
Since the mean-square value P, of the quantizing noise in the recon- 
structed signal is identical to the mean-square value P, of the quantizing 
noise in the decoded difference signal, P, on the right side of equation (7) 
can be replaced by P, , and one obtains 
Pe _P. Ps 


P, - Ps Pa () 


(7) 


III. APPLICATION OF PREDICTIVE CODING TO SPEECH SIGNALS 


3.1 Linear Prediction of Speech Signals 


Two of the main causes of redundancy in speech are: 


(<2) Quasi-periodicity during voiced segments® and, 
(it) Lack of flatness of the short-time spectra] envelope.° 


The exact form of the predictor for the speech wave depends on the 
model used to represent the human speech production process. A 
reasonable model for the production of voiced speech sounds is obtained 
by representing them as the output of a discrete linear time-varying 
filter which is excited by a quasi-periodic pulse train (see Fig. 2). The 
output of the linear filter at any sampling instant is a linear combination 
of the past p output samples and the input. The number of past samples 
p is given by twice the number of resonances (formants) of the vocal 
tract which are contained in the frequency range of interest. For ex- 
ample, in the case of speech signals band-limited to 3 kHz, it can be 


PT TAL. Op Sp 


TRANSVERSAL 
FILTER 


Fig. 2—Model for the production of voiced speech sounds. 
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assumed that there are typically three to four formants.’ A suitable 
value of p: is thus 8. 

Let s, and U, be the amplitudes of the output and input signals 
(see Fig. 2) at the nth sampling instant. The nth output sample s, is 
then given by 


8, = >» AxSn—k “ U,, ) (8) 
k=1 


where 
U,, = BU,_M ; (9) 


M is the period of the excitation signal and 6 takes account of the 
variation of the amplitude of the input pulse train from one period to 
the next. For natural speaking conditions, the period of the excitation 
signal is usually below 15 milliseconds, and, as a first approximation, 
the effect of time variation of the coefficients a, from one pitch period 
to the next can be neglected. Under this assumption, we find 


p 


8S, — BSr-uw = a 1 (Sp—z = BSy—n— a1) Ses = BU aay s (10) 


k=1 


Since U, = BU,-2 , equation (10) reduces to 
S, = BS8r-ar + do Ox(Sn—k — BSn—z—a1); (11) 
=1 
which determines completely the structure of the linear predictor. 


A block diagram of the predictor as described by equation (11) is 
shown in Fig. 3. The delay M as well as the parameters a; , a2, °-+ , a, 


P,(z)=Bz-M 






PREDICTOR 
P, (2) 







PREDICTOR 
Pz (Z) 





8 
Po(Z) =)" an 2-9 
net 


Fig. 3—Block diagram of the predictor for speech signals. 
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and 6 are variable and are readjusted periodically to match the charac- 
teristics of the input speech signal. Ideally the readjustment of the 
predictor parameters need be done only when there are significant 
changes in the characteristics of the speech signal. This implies that 
the predictor should be readjusted at short intervals during transitions 
and at long intervals during steady state portions of the speech signal 
and, consequently, a long buffer storage is needed to ensure transmission 
of parameters at a uniform rate on the channel. In order to avoid the 
use of a long buffer storage, the predictor parameters were readjusted 
at a fixed time interval in our study. This time interval was chosen 
to be 5 milliseconds to ensure that the prediction be efficient even 
during rapidly changing segments of the speech wave. 

For unvoiced sounds, the quasi-periodic excitation U,, in equation. (8) 
is replaced by a noise-like excitation. Generally speaking, the transfer 
function of the filter for unvoiced sounds must include poles as well as 
zeros. However, we find that for all practical purposes it is sufficient to 
include only the effect of poles. Equation (11), thus, represents the 
linear predictor for unvoiced sounds too if 6 is assumed zero. 


3.2 Determination of Predictor Parameters 


The predictor parameters are determined by minimizing the mean- 
square error between the actual speech sample and its predicted value. 
The predicted value &, of the nth speech sample is given by 


Pp 
8, = BSr-ar + > a,(s -k 88-12-21) « (12) 
=1 


The prediction error sample £, is then given by 


EL, = 8 — &, 


(Sn — BS8r,-1) — >» 1 (Sn—x — BS8y—n—a1) « (13) 


k=1 


The mean-square prediction error (EH2),, is given by 
1 
(En)av “= N LE ’ (14) 


where the sum extends over all the samples in the time interval during 
which the predictor is to be optimum. 

The problem of minimizing the mean-square error (°),, by suitable 
selection of the predictor parameters does not admit a straightforward 
solution due to the presence of the delay parameter J/ in equation (18). 
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A sub-optimum solution was obtained by minimizing the total error 
in two steps. First the parameters 6 and M are determined such that 
the error FE, , defined by 


By = 39 XG — Boa)? = (6s — Bona) ee (15) 


is minimum. Using these values of 6 and 7, the mean-square error (E*),, 
is minimized by a suitable choice of parameters a, , --- , ap. 

To find the values of the parameters 6 and M which minimize the 
error HZ, as defined in equation (15), we first set the partial derivative 
of HZ, with respect to 6 equal to zero: 


a, 
op 


| 


= —2(s, a BSn—a1)Sn—a1)av 


= 0, (16) 


where the ( ),, indicates the averaging over all the samples in the 
given 5-millisecond time segment during which the predictor is to be 
optimum. — 

On solving for 8 from equation (16), we obtain 


B = Corre meyers ee . (17) 

We next substitute the value of 6 from equation (17) into equation (15). 
After rearrangement of terms, we obtain 

EB, = (sz) a (Sn: dp )ccl Sasaki . (18) 


Since the first term on the right side of equation (18) does not depend 
on M, it can be omitted in finding the minimum value of the error. 
Further, £, is minimum if the second term on the right side of equation 
(18) is maximum. The optimum value of M is thus determined from 
the location of the maximum of the normalized correlation coefficient 
p given by 


p= (eiBincua ect RUS ee Saye M > 0. (19) 


Next, the predictor parameters a; , --: , a, are obtained such that 
the mean-square error (£2),, as given in equation (14) with 6 and M 
fixed at their optimum values is minimum. Let 


Vv, = 8, Bs. —-M-: (20) 
The error (H2),, is then given by 
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The optimum values of the coefficients a, , --- , a which minimize 
(E2),, are obtained by setting the partial derivatives of (H?),, with 
respect to a; , --- , a, equal to zero. Or, 


O{En yaw _ (( 2 ) ) 
da; aac Un > AVn-k Vn-j oo ? 


= 0 for 7 = 1, 2; *** 59. (22) 
Equation (22) can be rewritten in matrix notation as 
da = ty, (23) 
where ® is a p by p matrix with its (zj)th term ¢;; given by 
Pig = Va-iYn-i)av » (24) 


a is a p-dimensional vector whose jth component is a; and t is a p-di- 
mensional vector whose jth component y; is given by 


V; = CO sae oe . (25) 


The optimum predictor coefficients a, , a , *** , @ are obtained by 
solving equation (23) for a. For the case when ® is a nonsingular matrix, 
the solution of equation (23) presents no difficulty. The vector a can 
be obtained by multiplying & with the inverse of the matrix ®. A more 
efficient computational procedure’ for solving equation (23), which 
does not involve matrix inversion, takes advantage of the fact that ® 
is a Symmetric matrix, and thus can be expressed as the product of a 
triangular matrix and its transpose. Equation (23) can then be written 
as three separate matrix equations. These equations involve triangular 
matrices only and their solutions can be expressed by a set of recursive 
equations.” 

A singular @ matrix implies that one or more of its eigenvalues is 
zero. The matrix ® can be modified to become nonsingular by adding a 
small positive constant to its diagonal elements. Equation (23) is 
solved again with the matrix ® replaced by the matrix ®’. The modi- 
fied matrix ®’ is symmetric and has the same eigenvectors as the matrix 
®, but its eigenvalues are all positive; thus it is a positive definite sym- 
metric matrix and has a unique inverse &’~*. 


3.3 Computer Simulation of the System 


The predictive coding system using adaptive predictors was simu- 
lated on a digital computer to determine its effectiveness for coding 
speech signals. The transmitter and the receiver are illustrated sepa- 
rately in Figs. 4 and 5, respectively. The sampling rate used in this 
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Fig. 4—Transmitter of the predictive coding system. 


simulation was 6.67 kHz. Prior to sampling, the input speech signal 
was filtered with a low-pass filter with 3-dB attenuation at 3.1 kHz 
and an attenuation of 40 dB or more for frequencies above 3.33 kHz. 
At the transmitter, the difference 6, formed by subtracting the pre- 
dicted value &, from the speech sample s, was quantized by a two-level 
(1 bit) quantizer with variable step size g. The parameter qg was re- 
adjusted every 5 milliseconds to yield minimum quantization noise 
power. The parameters of the adaptive predictor were also computed 
once every 5 milliseconds and sent to the receiver together with the 
binary difference signal and the step size q of the quantizer. The opti- 
mum value of the delay parameter M/ was obtained by locating the 
maximum of the correlation coefficient p as defined in equation (19) 
for values of 7 between 20 and 150. The parameter p was set at 8. 

The speech signal was reconstructed at the receiver by a feedback 
loop containing an adaptive predictor identical to the one used at the 
transmitter. Here, the predictor too, was reset every 5 milliseconds 



























t 
DIGITAL Pt) 
CHANNEL : LOW PASS 
Ceeges DECODER FILTER SS 
3.1 kHz SPEEGH 


SIGNAL 





ADAPTIVE 
PREDICTOR 


PREDICTOR PARAMETERS 


Fig. 5—Receiver of the predictive coding system. 
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according to the predictor-parameter information received from the 
transmitter. The reconstructed speech samples were finally smoothed 
by a 3.1-kHz low-pass filter to form the output speech signal r’(Z). 


IV. RESULTS OF SUBJECTIVE TESTS 


Two different subjective tests were conducted to judge the quality 
of the reconstructed speech signal produced at the receiver of the pre- 
dictive coding system. In the first test, trained listeners compared the 
reconstructed speech signal with speech from a logarithmic PCM 
(log-PCM) encoder” that used the same input signals and a sampling 
frequency of 6.67 kHz. The compression characteristic employed in a 
log-PCM encoder is defined by the equation 


V tog [1 + #421 
Yu “Yog + p) 


where y represents the output voltage corresponding to an input signal 
voltage x, » is a dimensionless parameter which determines the degree 
of compression and V is the compressor overload voltage.” The com- 
pressed signal y was quantized at bit rates varying from 5 bits/sample 
to 7 bits/sample with » = 100 and V = 8 X the rms speech signal 
voltage.t Speech samples from both male and female speakers were 
used in these tests. The results of the subjective tests indicated that 
the quality of the reconstructed speech signal was better than that of 
log-PCM speech with 5 bits/sample but slightly inferior to one with 
6 bits/sample. The corresponding measured signal-to-noise ratios for 
log-PCM speech were 21 dB and 27 dB, respectively. 

In the second test, the reconstructed speech signal was compared 
with the input speech signal contaminated by additive white noise 
obtained by randomly inverting the polarity of successive Nyquist 
samples of the input speech signal.’* This noise is subjectively similar 
to the distortion introduced by predictive coding and is therefore 
particularly appropriate for reproducible comparisons. This noise has 
an added advantage in that its absolute amplitude at any instant of 
time is proportional to the absolute amplitude of the input speech 
signal. This proportionality permits the calculation of a precise signal- 
to-noise ratio (S/N). Based on the results of these tests, the equivalent 
S/N of the reconstructed speech in the predictive coding system de- 


sgn x, (26) 


} The integration time for computing the rms value of the speech signal was 
several seconds and included speech samples from a number of speakers. 
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scribed above was found to be about 25 dB which is in good agreement 
with results obtained by the subjective comparison with log-PCM. 


Vv. ADDITIONAL MODIFICATIONS OF THE PREDICTIVE CODING SYSTEM 


5.1 Spectrum of Quantizing Noise and Its Influence on the Subjective 
Quality of the Reconstructed Speech 


Tor frequencies above 500 Hz, the frequency spectrum of voiced 
speech sounds generally falls off with frequency with an average slope 
between —6 and —12 dB per octave. The spectrum of quantizing 
noise in the predictive coding system, on the other hand, is approxi- 
mately uniform. The signal-to-quantizing noise ratio (S/N) of the 
reconstructed speech, thus, also falls off with frequency. This is illus- 
trated in Fig. 6 where the spectrum of a short segment of the speech 
signal is compared with the spectrum of the corresponding quantizing 
noise. As can be seen, the S/N is very poor at high frequencies. In- 
formal listening tests of the reconstructed speech appeared to confirm 
the above observation. The quality of the reconstructed speech can 
thus be improved by a suitable shaping of the spectrum of the quantiz- 
ing noise so that the S/N is more or less uniform over the entire fre- 
quency range of the input speech signal. The desired spectral shaping 
can be achieved by pre-emphasizing the input speech signal at high 
frequencies by means of a fixed filter whose amplitude versus fre- 
quency characteristic rises with frequency above 500 Hz with a 
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slope of 12 dB per octave. The spectral distortion can finally be elimi- 
nated by a filter at the output of the receiver whose frequency versus 
amplitude characteristic is exactly opposite to that of the pre-emphasis 
filter. The results of computer simulation indicate that the quality of 
the reconstructed speech in the predictive coding system employing 
pre-emphasis is considerably better than that of the system without 
pre-emphasis. 


5.2 Improved Prediction of Voiced Speech 


The redundancy due to the quasi-periodic nature of voiced speech 
is removed in the predictive coding system described earlier by a pre- 
dictor P,(z) consisting of a delay of JJ samples and an amplifier with 
gain 8 as shown in Fig. 3. It is possible to improve the prediction of 
voiced speech by employing a predictor P,(z) consisting of two delays 
and two amplifiers such that 


P,(z) = pe + Baer (27) 


The parameters 8, and @. are calculated by minimizing the mean- 
square error //, defined by 


E, = (8, = Bi Sian Bo8n-a00)” dav . (28) 


The modified predictive coding system including pre-emphasis of the 
input speech signal together with the second-order predictor P,(z) as 
given in equation (27) was simulated on the computer. The results of 
subjective tests similar to those described in Section IV indicated that 
the quality of the reconstructed speech was somewhat superior to that 
of log-PCM speech at 6 bits per sample. The equivalent S/N was found 
to be 30 dB. 


VI. QUANTIZATION OF PREDICTOR PARAMETERS 


No attempt was made in the study reported here to quantize the 
predictor parameters. Preliminary calculations were made to estimate 
the number of bits required to transmit the information to the receiver. 
Since the predictor parameters (one delay and nine other coefficients) 
carry the information about the signal spectrum, it should be possible 
to encode them at a bit rate comparable to one used in conventional 
formant vocoders. This suggests a bit rate of approximately 10 kilobits 
per second for transmitting the binary difference signal (6.67 kb/s) 
and the predictor parameters (3 kb/s). Recent studies by Kelly 
indicate that it is indeed possible to encode the transmitted informa- 
tion within 9600 b/s without significant loss in speech quality. 
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VII. CONCLUSIONS 


The study reported here shows that predictive coding is a promising 
approach to digital encoding of speech signals for high-quality trans- 
mission at substantial reductions in bit rate. Unlike past speech coding 
methods based on the vocoder principle, the predictive coding scheme 
described here attempts to reproduce accurately the speech waveform, 
rather than its spectrum. Listening tests show that there is only slight, 
often imperceptible, degradation in the quality of the reproduced 
speech. Although no detailed investigation of the optimum encoding 
methods of the predictor parameters was made, preliminary studies 
suggest that the binary difference signal and the predictor parameters 
together can be transmitted at bit rates of less than 10 kb/s or several 
times less than the bit rate required for PCM encoding with comparable 
speech quality. 
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All Terminal Bubbles Programs Yield the 


Elementary Symmetric Polynomials 


By R. P. KURSHAN 
(Manuscript received May 18, 1970) 


R. L. Graham has discussed various combinatorial aspects of the 
behavior of magnetic domains or ‘“bubbles’”.* Representing the initial 
state of a configuration of n magnetic domains by the n-tuple of in- 
determinates B = (X,,--- , X,), he showed that subsequent configura- 
tions of magnetic domains obtainable (within the constraints of the 
problem) correspond exactly to subsequent n-tuples of Boolean ex- 
pressions in the X,’s* obtainable from B through an application to B 
of a product of transformations (‘‘commands” in Ref. 1) of the form 
T;;1 S71<37S n) whereif P = (P,,--- , P,) is an n-tuple of Boolean 
expressions in the X,’s, then 7;;(P) = (Qi, -+* , Qn), 


PENS Pe US ‘| 
Oe SSP ef \ Pi at b= 975 k=1,---,n. 
ie otherwise | 
Furthermore, he showed that 


if 3 is an ($)-fold product of such transformations (T) 
and if 7 is any other, then (7 o 3)(B) = 3(B). 


This provides a limitation on the number of distinct n-tuples of the 
form U(B) = (P,, --- , P,.) where U is a product of transformations, 
and hence provides a limitation on the number of distinct P,’s thus 
obtainable from various U’s. Graham showed that for n = 11, this 
limitation implies that not all Boolean expressions in the X,’s are 
realizable as a P,. 

This led to an (as yet unsuccessful) attempt to characterize those ex- 
pressions which are realizable. The purpose of this note is to observe a 
fragmentary result in this direction: that if 3 is as above, then 3(B) = 


* A Boolean expression in the X,’s is either a term of the form X;(1 S7 Sn), a 
term of the form P U Q ora term of the form P \ Q, where both P and Q are Boolean 
expressions in the X,’s; expressions may be reduced as if the X;,’s were sets. 
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(S;, --- , S,) where S; is the elementary symmetric polynomial in 
X,, ‘+: , X, of degree z (here interpreting as + and /\ as -). The 
situation will be rephrased in terms of a semiring. 

For a fixed 7 let R be the (Boolean) commutative semiring generated 
by X,, --: , X, subject to the relations: 


for ~=1,---,n, (1) X37 = X, 
(2)fX;+f=f forall fER. 


It follows that 2X; = X,;(¢ = 1, --- , n) and hence, each f C Risa 
Boolean polynomial in the indeterminates X,, --- , X,, (that is, the 
X,’s behave like sets with respect to -+- and - interpreted as U and 
respectively). 

Throughout, if  € R” (the set of n-tuples of elements of f), then 
for 1 S k S n, x, will denote the kth coordinate of x, that is, x = 


(%1, «** , Ue *** , Un). Let JT (or T,) be the set of transpositions of 
{1, --- , n} and fort © T—say t = (2, 7), 7 < j—define ¢: R°— R” by 
fitf, if k=i 
(if), = fief, Wo k=j7 (- LetB=B, = (%,-°: ,Xna)e R" 
ce otherwise | 


and set ©, = U™,7"(B) where m = (%)* and 7” = {tyto-++ te |ti,te,°°° , 
t, € T}. A point C € @, is said to be terminal if t(C) = Cforallt € T. 
It is not hard to see that (S,, --- , S,) is a terminal element of €, where 
S;(1 S ¢ S n) is the elementary symmetric polynomial in X,, --: , X, 
of degree 7; in what follows it will be shown that this characterizes the 
terminal elements of @,,. 

The elements of R may be partially ordered by f Sg @f+g= 4g. 
For D € R*",1 Sj S n, define D’ C R" by Di = D(X, +++ , XG-1, 
0, Xja1, — » Xn) 1S7135%n. 


IV 


Lemma 1: C ts terminal @ C, = C, = --- Co 


Proof: Obvious. 


* By (fT), Cn = Uno T*(B); on the other hand @, = U'z29 T(B) => r 2 m: 
using notation developed below, this can be proved by induction on 7 as follows. 
If n = 1 it is clear; assuming it is true for a given n, identify C, with {D™™!,| DE 
Cnzi} C Cn4i (see remark following Lemma 3). Using the theorem below and the 
induction hypothesis, there is a J such that J(Bayi) = (Sim, So, --- , Sa24, 
Xn41), and g is a product of at least (%) transpositions. Let g’ = (1 2)(2 3) --- 
(n n + 1)9; then 9’(Bn4i) = (Si, +++ , Sny1), J’ is a product of (#7) + n = (*$1) 
transpositions and if for some U (UWJ)(Bnii) = J’(Bny1) then U must be a product 
of at least » transpositions. 
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Lemma 2: If f,g € R are such that X; divides no summand of either, 
then f + X hy = g + X,h, => f = g- 


Proof: Writing f + X;h, as a sum of products of X,,’s, both f and g 
are precisely the sum of those products which are not divisible by X;. 


Lemma 8: If D € @,, then for each j = 1, --+ , n there exists 7 such 
that Di = 0. 


Proof: Assume D € @, andi <j Sn. Findt,, --- ,¢, € T such that 
iB = Dwheret = 1t,t,_,---t,.Ifr = 1,sayt = (a,B),a< Bifj Xa 
then Di = 0 and if j = a then Dj = 0. Now assume the assertion is 
true whenever r < u, and D = #t, --- 4B. Find 7 such that (4,1 --- 
t,B)i = O and let t, = (a, 8), a < @. As above, if 7 ¥ a then Di = 0 
and if 7 = a then Dj = 0. Induction on r completes the proof. 


Given D € @,, Lemma 3 provides the machinery for associating D’ 
in a natural way with an element D’ of @,_,: making the initial associa- 
tion X; > X,_, in B, andi— 7 — 1in 7, forz > j, define D’ = t! --- 
tB,-1 where if tn = (a, B),a < 6 then 


has " if (in-1 °°: 4B,)i #0 for i =a, ‘| 
identity otherwise 


for 1 < m S 1. It is clear that D’ represents a collapsing of D at a 
coordinate 7 where Di = 0 plus a permutation z of the other D?’s: 
Di = (Dia Dii2) sat) ei 

However, the extent of possible permuting is limited by the com- 
pleteness of the order S on the Di’s as is demonstrated in the next two 
lemmas which apply for1 S$ 2,9,k S n. 


Lemma 4: D € @,, D; S Dj => 3 S21. 


Proof: It suffices to note that an application of a transposition to a 
member of ©, preserves the order of the indices. 


Lemma §: D; = D, = Di S Di. 


Proof: Writing D; = Di + X;g and D, = Di + X;h, obtain Di + 
X;h = D, = D; + D, = Di + Di + X;(9 + A) which by Lemma 2 
implies that Di = Di? + Di, that is, Di < Di. 


It follows from Lemmas 1, 3, 4 and 5 that if C € @, is terminal, then 
Cc’ = (Ci, Ci, «++ , Ci_,, 0) and C’ is terminal in ©,_, forl Sj Sn. 


Theorem: C € @, is terminal © C; = S,;,(1 Sin). 
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Proof: =. This direction is clear. 


=>. By induction on n—if n = 1 then ©€ = {B} and B = (X,) so the 
assertion holds. Now assume the assertion holds for n < k, and let 
C E @, be terminal. Then each C’ is terminal in @,_, and hence by the 
induction hypothesis each C? = Si@@=1,---,k—1;j7 =1,---,k). 

In particular then C; 4 X,X,.--- X, fort = 1,--- ,k — 1. Further- 
more, each C’; can be expressed as C; = P, + --- + P, where each 
P.,, 1s a product of some but not all of the X,’s. It follows for 7 < k that 


k k 
Ci= >> P,, and consequently C; = d Ci = > S,= 


XiitPm 


It is left to the reader to show that C, = S, and thus complete the 
induction argument. 
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